arXiv:1408.0629vl [math. CO] 4 Aug 2014 



Bounds for variables with few occurrences 
in conjunctive normal forms 

Plivcr Kullmann| [Xishnn Zhao| * 

Computer Science Department Institute of Logic and Cognition 
Swansea University Sun Yat-sen University 

Swansea, SA2 8PP, UK Guangzhou, 510275, P.R.C. 

August 5, 2014 



Abstract 

We investigate connections between SAT (the propositional satisfiability 
problem) and combinatorics, around the minimum degree of variables in vari- 
ous forms of redundancy- free boolean conjunctive normal forms (clause-sets). 

Let pvd(F) £ N for a clause-set F denote the minimum variable-degree, 
the minimum of the number of occurrences of a variable. A central result is 
the upper bound a(F) + 1 < yvd(F) < nM(cr(F)) < cr(F) + 1 + log 2 (a(F)) 
for lean clause-sets F £ LEAN in dependency on the surplus a (F) £ N. Lean 
clause-sets, defined as having no non-trivial autarkies (partial assignments 
satisfying some clauses and not touching the other clauses), generalise mini- 
mally unsatisfiable clause-sets, i.e., LEAN D MU. For the surplus we have 
a{F) < 6(F) = c(F) — n(F), using the deficiency 6(F) of clause-sets, the dif- 
ference between the number c(F) of clauses and the number n(F) of variables. 
nM(fc) £ N is the fc-th “non-Mersenne” number, skipping in the sequence of 
natural numbers all numbers of the form 2” — 1. As an application of the upper 
bound we obtain, that clause-sets F violating pvd(F) < nM(a(F)) must have 
a non-trivial autarky, so clauses can be removed satisfiability-equivalently. We 
obtain a polynomial time autarky reduction, but where it is open whether such 
an autarky itself can be found in polynomial time. 

We show that the upper bound is sharp, i.e., fivd(LEANs=k) = nM(fc) 
for all deficiencies k £ N, where yvd (L£ANs=k) is the maximum of pvd(F) 
over F £ LEANs=k ■ The determination of pvd(MUs=k) ='■ pnM(k) seems 
to be a much more involved question. We show that for k < 5 we have 
pnM(k) = nM(fc), but for k = 6 we have finM(k) = nM(fc) — 1. Moreover this 
correction by —1 causes further corrections by —1 for infinitely many other 
deficiencies, resulting in the upper-bound function nMi : N — > N, an instance 
of a generalised non-Mersenne function found by a novel recursion scheme. 

Extensive introductions, overviews, conclusions, examples and open prob- 
lems are provided. 
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1 Introduction 

In this work we aim at bringing together some aspects of combinatorics with the 
developing theory of SAT. We concentrate on degree considerations in “clause-sets” 
(conjunctive normal forms as set-systems), which can be considered as generalised 
hypergraphs, namely hypergraphs with “polarities” . The general goal is to develop 
an understanding of propositional (un) satisfiability, which corresponds for hyper- 
graphs to an understanding of (non-)2-colourabilitv. 

SAT, the prototypical NP-complete problem ([[12|), took a strong development 
in the past two decades also regarding (industrial) applications (see the handbook 
]?j for a recent overview). It is often mainly considered as belonging to complex- 
ity theory, algorithms and heuristics (with @ 0 the basic papers here), and 
finally implementations and experimentation (“SAT solvers”). “Understanding” 
SAT in a precise sense is considered to be impossible, and only various investiga- 
tions on random and approximation structures (including “islands of tractability” ) 
in general are deemed fruitful. We want to challenge this view, starting to build 
a new bridge, towards an understanding of unsatisfiability. We note here that 
understanding unsatisfiability seems easier than to understand satisfiability, since 
unsatisfiability means a form of completion, all assignments have been excluded as 
potential satisfying assignments (“models”), while satisfiability means a lack of such 
completion. More precisely, we aim at understanding minimal unsatisfiability, the 
building blocks of unsatisfiability — similar to critical colourability, here removal 
of any clause renders the clause-set satisfiable. 

A fundamental question, the subject of this study, is the existence of “simple” 
variables in clause-sets. “Simple” here means a variable occurring not very often. 
A major use of the existence of such variables is in inductive proofs of properties 
of minimally unsatisfiable clause-sets, using splitting on a variable to reduce n, the 
number of variables, to n — 1: here it is vital that we have control over the changes 
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imposed by the substitution, and so we want to split on a variable occurring as few 
times as possible. “Splitting” of a clause-set F on variable v means the consideration 
of the clause-sets (v — > 0) * F, (v — > 1} * F, that is, instantiating variable v by both 
truth values 0, 1. A feature of clause-sets is the closure under splitting, and splitting 
is a major tool for investigations into minimal unsatisfiability. 



1.1 Deficiency as the main structural parameter 

The definition of the class CCS of “clause-sets”, and of the class MU C CCS of 
“minimally unsatisfiable clause-sets” , can be quickly (and precisely) given as follows, 
using (just) natural numbers as “variables” : 

A “literal” x is an element of Z \ {0}. A “clause” C is a finite set of literals, 
such that there is no x £ C with — x £ C. Using — L := {—a; : x £ L} for sets L of 
literals, the “clash-freeness” condition for C becomes C D — C = 0. A “clause-set” 
F is a finite set of clauses, the set of all clause-sets is denoted by CCS. The most 
basic measurements for F £ CCS are: 



• the number c(F) := \F\ € No of clauses of F; 

• the number n(F) := |var(F)| £ No of variables of F, where var(F) is the set 
of v £ N (variables as positive integers) with {u, — z>} ft (J F ^ 0; 

• The “deficiency” 6(F) := c(F) — n(F) £ Z. This parameter is only informative 
when certain (weak) assumptions are made for F , and for general F the 
“maximal deficiency” 6*(F) := max^cf i5(U) £ No is to be used. 



A clause-set F is “satishable” if there exists a partial assignment (p, which here in 
this introduction is just a clause, such that ip fi D ^ 0 for all D £ The set of 
all satishable clause-sets is SAT C CCS , the set of all unsatisfiable clause-sets is 
US AT := CCS \ SAT. Finally MU C US AT is the set of F £ USAT such that 
for all C £ F we have F \ {C} £ SAT. 

The background for the investigations of this report is the enterprise of classi- 
fying F £ MU in depende ncy on 5(F). The basic facts are S*(F) = 6(F) (as will 
be discussed in Subsection [11]) , and the well-known 5(F) > 1, as first shown in ||. 
For 6(F) = 1 the structure is completely known (i B§; see Example [I~2| ), for 
5(F) = 2 the structure after reduction of singular variables (occurring in one sign 
only once) is known ( 1 42 ] ; see Example 3.2), while for 5(F) £ {3,4} only basic cases 
have been classified (|9£j). 

The starting point of our investigation is Lemma C.2 in jif|, where it is shown 
that F £ MU with n(F) > 0 must have a variable v £ var (F) with at most 5(F) 
positive and at most 6(F) negative occurrences; we write this as ldi?(u) < 6(F) and 
ldjr ( — v) < 5(F) , using the notion of literal degrees (the number of occurrences of 
the literal), where for a literal x its degree is 



\d F (x) := | {C£F:x£ C}\ £ N 0 . 



Thus we have vd F (v) < 2 6(F), using the variable degree 
vd F (v) := ld^(u) + ldf(-«) £ Nq. 



Using the minimum variable degree (min-var-degree) 

pvd(F) := min vdF(u) £ N 

p£var (F) 

^The clause ip is the set of satisfied literals of the corresponding “partial assignment”. This def- 
inition of “satisfying assignments” , via clauses intersecting every clause of F t generalises transver- 
sals of hypergraphs, by taking complementation into account (<p does not contain clashes). 
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of F with n(F) > 0, the upper bounds becomes pvd(F) < 2 8(F). A main theme 
of this report is the consideration of p.vd(MUs=k) £ N for k £ N, the maxi- 
mum of fivd(F) for F £ MU with 6(F) = k. The upper bound now becomes 
pvd(MUs-k) < 2 k. 

We show a sharper bound on /rvd(F'), namely we show that the worst-cases 
ldF(w), ldF(— v) < 6(F) can not occur at the same time (for a suitable variable), 
but actually ldp’(u) + ldp’(u) — 6(F) only grows logarithmically in 6(F). The really 
interesting aspect here is the precise determination of pvd(N iUs=k), and we inves- 
tigate the (elementary) number-theoretic function nM(fc), which yields the upper 
bound pvd(MUs=k) < nM(fc) for all k £ N, where the function nM : N — > N fulfils 
k + |k>g 2 (fc + 1)J < nM(fe) < k + 1 + [log 2 (fc)J for k £ N. 



1.2 Refining deficiency by surplus 

After having settled this basic min-var-degree upper bound for MUs=k, we show a 
sharper bound on fivd(F) for a larger class of clause-sets F: 

• The larger class of clause-sets considered is the class LEAN of lean clause-sets 
(introduced in |f30| ), which are clause-sets having no non-trivial autarky. For 
an overview on the theory of minimally unsatisfiable clause-sets and on the 
theory of autarkies see Q. LEAN C CCS is the set of F £ CCS such that 
there is no partial assignment ip (a “non-trivial autarky” ) with the properties 

— for every clause D £ F with —p n D T 0 we have ip ft D ^ 0 (note that 
this generalises the satisfaction criterion); 

— there exists v £ var (F) with {v, — 1 >} fl ip ^ 0. 

Note LEAN fl SAT = T, where T := 0 £ CLS is the empty clause-set (the 
standard satisfiable clause-set). 



• The deficiency 8(F) £ Z is strengthened by the surplus <r(F) £ Z, defined in 
case of n(F) > 0 as follows. 

Consider the bipartite clause- variable graph of F (generalising the incidence 
graph of a hypergraph), with the clauses C £ F on one side of the biparti- 
tion, and the variables v £ var (F) on the other side, and an edge between 
v and C if {i>, — w} fl C T 0- The “expansion” of a set 0 ^ F C var (F) 
of variables is |T(V) | — |F|, where T(F) is the set of neighbours of V (inci- 
dent clauses), and the surplus then is the minimum expansion, i.e. , u(F) = 
min0^ycvar(F) |T(F)| - \V\. 

In the terminology of |73], Section 1.3], 6*(F) is the deficiency of the bipartite 
clause-variable graph (with bipartition (F, var(F)), while o(F) is the surplus 
of the bipartite variable-clause graph (with bipartition (var (F),F)). 

Note that by considering V = var(F) we have a(F) < 6(F ) , and by consider- 
ing V = {w} for v £ var (F) we get a(F) < pvd(F) — 1. 

We have a (F) > 1 for F £ LEAN with n(F) > 0 ( jHT], Lemma 7.7]), general- 
ising the basic fact 6(F) > 1 for F £ MU. 



Now a central result of this report (Theorem 9.8) is 



pvd(F) < nM(cr(F)) 

for F £ LEAN with n(F) > 0. As an application we obtain (Theorem S), 
that via removing satisfiability-equivalently some clauses (via some autarky), we 
can reduce every (multi-)clause-set F in polynomial time to a (multi-)clause-set F' 
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containing a variable occurring with degree at most <r(F') + 1 + log 2 (cr(F')). It is 
an open problem whether such an autarky can be found in polynomial time (for 
arbitrary clause-sets F); we conjecture (Conjecture 10.3) that this is possible. 

We also show sharpness of the upper bound, i.e., nvd(££AJ\fs=k) = nM(fc) 
for all k £ N, in Corollary 9.13 (proving Conjecture 23 from the conference version 
p|), which indeed holds for every class of clause-sets between VMU , i.e., “variable- 
minimally unsatisfiable clause-sets” as introduced in ]ll|], and ££AJ\f; the definition 
of VMU is as the set of F £ US AT such that for all F' C F with var(F') C var (F) 
we have F' £ SAT. 

We then come back to the special case of minimal unsatisfiability. Here things 
turn out to be much more complicated, and the numbers := /ivd(MUs=k) £ 

N for k £ N, the guaranteed minimum variable degrees for minimally unsatisfiable 
clause-sets of deficiency k, seem to be very complicated (and very interesting) quan- 
tities. We proof the sharpened bound /xnM(fc) < nMi(fc), which improves on nM(fc) 
for infinitely many k. 

According to the goal of bringing different communities together, we try to 
provide and explain much of the relevant background, so that this report is mostly 
self-contained, and the results cited from the literature can be treated as black- 
boxes. 



1.3 Some basic intuitions about the upper bound nM 

As already mentioned, the function nM : N — >• N is strictly increasing with range 

nM(N) = N\{2" — 1 : n £ N} = {2, 4,5,6, 8 ,..., 14, 16,17,...}. 

We show jjwd(C£ AAf s=k ) = nM(fc) for dehciencies k £ N, that is, every lean clause- 
set F with n(F) > 0 contains a variable v £ var(F) with vd^ (v) < nM(c5(F)), and 
for every deficiency k > 1 there are lean clause-sets F with /ivd(F) = nM(<5(F)). 

The underlined values 2, 6, 14, ... , which have the form 2” — 2 for n £ N, are the 
function values at the “jump positions” 1, 4, 11, ... , which are of the form 2 n — n— 1 
for n > 2 (where the function values changes by +2, while otherwise it changes 
by +1 for an increment of the argument). This basic structure of nM can be 
motivated by the following constructions of F £ MU with “high” min-var-degree; 
indeed these considerations only concern the lower bounds, given by appropriate 
constructions, while the arithmetic nature of nM(fc) rests on different considerations, 
but for the deficiencies considered here, lower and upper bounds are equal, and the 
lower bounds are easier to understand here. 

The basic clause-sets are the A n for n £ No, which consist of all 2 n sets (clauses) 
of numbers ±1, . . . , ±n, using the natural numbers 1, . . . , n as variables. So Aq = 
{0}, A 1 = {{-1}, {1}}, A 2 = {{1, 2}, {-1, 2}, {1, -2}, {-1, -2}} and so on. It is 
easy to see that we have A n £ MU with n(A n ) = n, c(A n ) = 2 n = jjtvd(A n ), and 
8 (A„) = 2 n — n. We will see that the A n have the largest possible min-var-degree 
2 n for given deficiency 2 n — n, and we also have nM(2™ — n) = 2 n for n £ N. These 
deficiencies k = 2 n — n (numerical values are 1, 2, 5, 12, ... ) are the positions directly 
after the jump positions (excluding deficiency k = 1 as a special case). 

How can we obtain from that more clause-sets in MU with high min-var-degree? 
Consider A 3 : we have e.g. (1, 2, 3}, {1, 2, —3} £ A 3 ; now logically these two clauses 
are equivalent to {1,2} (i.e., we have the same satisfying assignments; technically, 
a “strict full subsumption resolution” is performed), and we obtain A' 3 := (A 3 \ 
{{1, 2, 3}, {1, 2, —3}}) U {{1, 2}} £ MU. Performing this process in general, using 
{1, . . . , n}, {1, . . . , n — 1, — n} £ A n , yields A' n £ MU for n > 2, with n(A' n ) = n, 
c(A ' n ) = 2" — 1, 8(A' n ) = 2 n — n — 1 , and fivd(A' n ) = 2" — 2 (the (single) variable 
with minimum occurrences is n). These deficiencies are precisely the jump positions 
2" — n — 1, and accordingly we have nM(2" — n — 1) = 2 n — 2. 
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Performing the same trick again to A ' 3 , we might replace {— 1,2, 3}, {-1,-2, 3} G 
A' n by { — 1,3}, obtaining A 3 G A4ZJ. Again for general n > 3 we get A" G A47/, 
n(A") = n, c(A") = 2" — 2, <5(A") = 2 n — n — 2, and /rvd(A") = 2 n — 3; note 
here the crucial difference, that the min-var-degree has only been changed by — 1. 
The reason is that there are two variables now with minimum occurrences, namely 
n — 1, n, where the degree of variable n changed first by —2, then by —1, while for 
variable n — 1 the degree first changed by —1, and then by —2 (and for the other 
variables 1, . . . , n — 2 we had degree changes by —1, —1). 

Now one might imagine this process of strict full subsumption resolution contin- 
uing until deficiency 2 n_1 — (n — 1) + 1, always with change of the min-var-degree 
by —1, just before the deficiency of the previous A n _ 1 — this would yield the 
function nM. However the combinatorial reality is more complicated, and as we 
prove in this report (Section 111), at least we can not get until 2 n 1 — (n — 1) + 1 
for n > 4 (in effect), that is, at these deficiencies k = 6,13,28,... we have 
pvd(MUs-k) < nMi(fc) = nM(fc) — lj^] 



1.4 Related work on M.U 



A general overview on minimally unsatisfiable clause-sets (also “minimal unsatis- 
fiable clause-sets/formulas”, or “MU”) is J43|; later developments are in [|?|, 
(generalisations to non-boolean clause-sets) and in |l4, 65 (studying “singular DP- 



reduction” , the elimination of variables which occur in one sign only once) . 

Two early papers on the complexity aspects are ||l|, |8(J, who introduced the 
complexity class D p and showed that the decision a F G A iU ?” with input 
F G CCS is complete for this class. Another important early paper is |j), which 
showed 5(F) > 1 for F G MU, where the notion of “deficiency” was introduced by 
|25| . Furthermore || showed polytime-decision of the sub-class SMU$=i C MU&=\ 
(called “strongly minimal unsatisfiable” there), where SMU C MU is the set of 
F G US AT such that for all C G F and all x G Z \ {0} with {x, — x} n C = 0 holds 
(F \ {C}) U{CU {x}} G SAT , that is, adding any literal to any clause renders the 
clause-set satisfiable. We use the terminology “saturated minimally unsatisfiable” 
introduced in ^4|, where the important connection to splitting was introduced, and 
a simpler proof of 5(F) > 1 for F G A iU was given. Just for this introduction we 
handle “partial assignments” via clauses <p (containing the satisfied literals; thus 
—<p is the set of falsified literals), so for a literal x the partial assignment ( x — > 0) 
is given by {— x}, while (x — > 1) is given by {x}. The application of ip to F G CCS 
is defined as 

p * F := {D \ —C :DGFACnD = 0}G CCS , 

that is, removing first the satisfied clauses from F, and then the falsified literals from 
the remaining clauses. Now for F G CCS holds F G SM.U iff for all iGZ \ {0} 
holds (x -> 1 )* F G A iU (the “only if” -direction was shown in Jm|, the “in- 
direction in [|49|). Due to this property plus the property, that every F G MU can 
be “saturated” by adding literals to clauses, the class SMU is an important helping 
class for investigations into MU via the splitting method, splitting up F £ MU 
into (v — > 0) * F and (v 1) * F for selected variables v. 

We have already mentioned the literature concerned with characterising the 
classes MUg=k (and subclasses) for small deficiencies k < 4. Less ambitious is the 
goal of poly time decision of these classes: the problem was raised in Q, and has 
been solved via two independent approaches in Q and [|d| (indeed establishing 
polytime SAT decision for inputs F G CCS and fixed 5*(F)), later strengthened in 



2 ) We do not apply the above method for gaining, lower bounds as far as we can, but only as 
needed in this report; see the end of Subsection L2.1 for some further remarks. 
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93] (showing that SAT decision is even fixed-parameter tractable in S*(F); see also 
57] for generalisations and simplifications). 



1.4.1 MUS 



As we have already mentioned, we consider MU as the “primal” building block 
for understanding unsatisfiability. In general an unsatisfiable clause-set can contain 
many minimally unsatisfiable sub-clause-sets, called “MUSs” . The task of enumer- 
ating all of them or at least some “good” ones is also of practical importance, to 
extract more information on the “causes” of unsatisfiability. A recent overview is 
w hile a clean approach to enumerate all MUSs, via hypergraph transversals, is 
in |71|] (the earliest appearance of the underlying observation seems 0. Theorem 2] ; 
compare also |i8| Subsection 4.3] for generalisations of the fundamental approach). 
See also pi for a reflection on various types of such sub-clause-sets, and on the 



connection to autarky theory (compare Subsection 1.6.3). 



1.4.2 Tovey’s problem (uniform clause-sets) 

This report appears to be the first systematic study of the problem of minimum 
variable occurrences / degrees in minimally unsatisfiable clause-sets and generali- 
sations, in dependency on the deficiency — asking for the existence of a variable 
occurring “infrequently” in general, or for extremal examples where all variables 
occur not infrequently. The “dual” problem is to consider maximum variable oc- 
currences / degrees — asking for the existence of a variable occurring frequently in 
general, or for extremal examples where all variables occur not frequently. More 
precisely, the maximum variable degree is 

vvd(F) := max vd^(rd £ N, 
vGvar (F) 

for n(F) > 0, while for a class C C CCS of clause-sets, the quantity i'vd(C) (to be 
studied) is the minimum of wd (F) for F £ C. 

This problem has been well-studied for p-uniform minimally unsatisfiable clause- 
sets, starting with 0 MM- 0 We denote by p-CCS C CCS for p £ No the set of 
all F £ CCS with V G £ CCS : |C| < p, while by UCCS C CCS we denote the set of 
all uniform clause-sets, i.e., those F £ CCS such that for C, D £ F, C ^ D, holds 
\C\ = \D\. Finally p-UCCS := p-CCS nUCCS and p-UMU := p-UCCSnMU. Now 
the basic fact is 

^vd [p-UMU) >p + 1 

for p £ N (Q, generalised in ]^8| Corollary 7.3]). Trivially vvd(l-UMU) = 2, 
and easily one sees v\d(2-UMU) = 3, while by Q holds vvd(?>-UMU ) = 4. As 
reported in |Q, we have vvd{4-UMU) = 5, and these are all known precise values 
of vvd(p-UMU) (where the notation f(p) := vvd(p-UMU) — 1 was introduced in 
Q). In Q it was observed that extremal examples might be found in MUs= i, and 
this work was recently extended in [^8| , establishing the asymptotically tight bound 
linip^oo vvd(p-UMU) = 1 (where indeed p-UMU fl MUs=i is considered). 

In our setting, studying the classes MUg=k, the max-var-degree is not very 
relevant, since we have ^vd(AI^ = i) = 2, while isvd(MUs=k) = 3 for k > 2. This 
can be seen as follows: As already noticed in |94] , there is a poly-time transformation 
from CCS to the class CCS{ 1,2) C CCS , consisting of those F £ CCS where for 
every variable v £ var(F) we have ldi?(u) = 1 and ld^(— v) < 2. Namely if there 

3 ^We remark that typically in the literature the connections to minimally unsatisfiable clause- 
sets are not emphasised, but it is clear that when considering (uniform) unsatisfiable clause-sets 
with a maximum variable degree as small as possible, then one can restrict attention to (uniform) 
minimally unsatisfiable clause-sets (as worst- cases ) . 



is a literal x and two clauses C,D G F with x G C PI D, then we can introduce a 
new variable v, replace x in C,D by v, and add the new clause {— u,a;}, obtaining 
F' . We study such extensions under the name of “singular DP-extension”, but 
it is also easy to see directly that F' is satisfiable iff F is, that F' is minimally 
unsatisfiable iff F is, and that 6(F') = 6(F). By repeating this transformation, we 
obtain t 1,2 : CCS CCS( 1, 2). So for F G MU we get t 1,2 (F) G MU n CCS(1, 2) 
with 6(t(F)) = 6(F). Whence for all k G N we have vvd(MUs=k) < 3. Now 
trivially u\d(MUs=i) = 2 due to {{1}, { — 1}} G MUs=\. On the other hand, if 
for F G MU holds ^vd(F) < 2 (thus vvd(F) = 2), then via so-called singular 
DP-reduction this clause-set can be reduced to {_L}, whence F G MU$=\ (this is 
well-known; compare Example |3.2| later). 

So for the study of the max-var-degree, the uniformity restriction seems essential. 
This is similar to many investigations into (colour-) critical hypergraphs (discussed 
in Subsection 1.6.1 below), where uniformity is a crucial assumption, and the clause- 
length p is the main parameter. For investigations into the case of uniform (general) 
clause-sets, where clauses share at most one variable, see |^| |88|. The number of 
clauses in F G p-UMU has been studied in |||, showing that for p = 2 holds 
c(F) < 4n — 2, while for p > 3 there are F with c(F) = Q(n(F) p ). Finally, the 
number of conflicts (clashes) in F G p-UMU is considered in (g9|, and for a review 
of the use of the Lovasz Local Lemma in this context see [^7| . 

In contrast, for the study of the minimum variable degree as in this report, in 
dependency on the deficiency, the restriction to uniformity seems not interesting, 
and is also not needed, but unrestricted clause-sets are considered. We remark 
that for every p G N, p > 3, there is a polytime translation t p : CCS p-UCCS , 
such that t p (F) is satisfiable iff F is, t p (F) is minimally unsatisfiable iff F is, and 
6(t p (F)) = 6(F). This works by replacing clauses C with \C\ < p by clauses 
C U {t>}, C U {— v} for some new variable v (in the MU-case we will call this a “non- 
strict full subsumption extension”), and by replacing clauses C with |C| > C by 
clauses C' U {v}, C" U {— w} for some new variable v and choosing clauses C' , C" 
with C = C' U C" and \C'\ = p — 1, \C"\ > p — 1 (in the MU-case again we have 
a singular DP-extension). But the transformation t p appear to be useless, since it 
completely garbles the structure of F. 

We conclude these remarks on p-uniform clause-sets by the observation, that 
for p > 4 the instances involved above become quickly very big, and only methods 
from random analysis are available (which by nature are very rough) . It seems that 
these considerations do not have practical relevance. In contrast, we consider all 
minimal unsatisfiable clause-sets (and more), that is, the deficiency does not filter 
out clause-sets, but only organises them in layers. And for a wide range of deficiency 
values, say, k = 1 , . . . , 10000, there are interesting and relevant examples. 



1.5 Autarkies 



An important tool, used in this report to go beyond MU, is the theory of autarkies, 
which also provides a strong link to various areas of combinatorics; the relations to 
hypergraph colouring will be discussed in Subsection 1.6.3 . Recall that a partial 
assignment <p is an autarky for F G CCS iff every clause C G F touched by p 
(i.e., p fl (C U —C) / 0) satisfies C (i.e., ipflC ^ 0), which is equivalent to 
V F' C F : (p * F' C F' . Autarkies were introduced in [ff8| for improved worst-case 
upper bounds for SAT decision, applying that for an autarky <p obviously ip * F is 
sat-equivalent to F. For a recent overview see p3[. 



Autarky reduction. Autarky reduction , the reduction of F G CCS to tp * F G 
CCS for a non-trivial autarky tp, is an essential concept, algorithmically as well as 
for theoretical understanding; see Jh| Subsection 11.10] for an overview on finding 
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autarkies. If we reduce all autarkies, then we obtain the (unique) lean kernel of F. If 
there are no non-trivial autarkies, then we have a lean clause-sets, i.e., F £ C£AJ\f, 
as already mentioned in Subsection 1.2; the concept was introduced in |i(|, and |t3|, 
Subsection 11.8.3] contains more information. The lean kernel of F is the largest 
lean sub-clause-set of a clause-set, that is, (J{F' Q F : F' £ C£AJ\f}; for a recent 
paper on the computation of the lean kernel see j7f| . 

The decision of leanness is coNP-complete, and so consideration of special au- 
tarkies is of interest; actually, these considerations are not just “algorithmic hacks” , 
but in a sense represent various areas of combinatorics (for example matching the- 
ory) via “autarky systems” . 



Autarky systems. The notion of an “autarky system”, as a selection of special 
autarkies with similar good properties as general autarkies, was introduced in |5l| , 
partially further expanded in ]5(|, and overviewed in ][l3| Subsection 11.11]. 

The starting point for an autarky system is to single out a restricted notion of 
autarky. This restricted autarky notion implies a restricted satisfiability notion, 
namely clause-sets satisfiable via autarky reduction, using only these special au- 
tarkies. This is indeed equivalent for “normal autarky systems” to the clause-set 
being satisfiable by a single special autarky.^] Often the general autarkies of the 
system can be derived from the extreme case of satisfiability through such autarkies. 
For arbitrary autarky systems also the notions “minimal unsatisfiability” and “lean” 
are defined, and are central properties. 

Balanced autarkies are an example of a rather general autarky system, the ba- 
sis for autarkies for hypergraph colouring; here for an autarky, touched clauses 
need not only have some satisfied literal, but also some falsified literal. The corre- 
sponding satisfiability notion is “NAE-satisfiability” , and will be further discussed 



in Subsection 1.6.3 



Matching autarkies. The autarky system especially of importance in this report, 
besides the full system, is that of matching autarkies; for a short introduction see 
fdiil , Subsection 11.11.2]. They yield the set MC£AJ\f D C£AM of matching-lean 
clause-sets , and the set A4SAT C SAT of matching-satisfiable clause-sets (called 
“matched clause-sets” in psj): 

• A matching autarky for F £ CCS is an autarky p for F such that for all 
C £ F touched by p one can select xc C C with x £ <p such that the 
underlying variables var(a;c) are pairwise different. 

• We have F £ MSAT VF' C F : 6(F) < 0, i.e., 6*(F) = 0. 

• And F £ MC£AM ^VTcf: 6(F') < 6(F). 

• Thus for F £ MC£AN holds 6*(F) = 5(F) , and for F ^ T holds 6(F) > 1 
(note <5(T) = 0), a vast generalisation of this fact for JAU. 

• More strongly, we have for F ^ T that F £ M.CSAN a (F) > 1. 

• Every F £ CCS has a largest matching-lean sub-clause-set, the matching-lean 
kernel , namely (J{F' Q F : F' £ A4C£AAf}, computable in polynomial time 
(for example via reduction by matching autarkies). 

4 ) “Normal autarky systems” were called “strong autarky systems” in [fHj. Section 8]. 
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Linear autarkies. A stronger autarky system than matching autarkies is given 
by “linear autarkies” ; we will not use them for the results of this report, but they are 
an important link to combinatorics, and so we discuss them here; see [ fiTj[ Subsection 
11.11.3] for a more elaborated introduction. “Simple linear autarkies” for F £ CCS 
have been introduced in J5(|, based on linear programming. For F £ CCS we 
consider the clause- variable matrix M(F ), which is a c(F ) x n(F) matrix over R 
(or over Q for computational purposes), which encodes in the rows the clauses and 
in the columns the variables, by using 0 for absence of the variable, and ±1 for 
positive resp. negative sign. Now the simple linear autarkies <p are obtained from 
solutions x £ of M{F) ■ x > 0, by translating the values Xj, where the indices 

i correspond to the variables of F, into “unassigned” for x- L = 0, “true” (i.e., 1) for 
Xi > 0, and “false” (i.e., 0) for Xi < 0. It is an easy exercise to see that this yields 
indeed autarkies. We have a non-trivial simple linear autarky iff M(F) ■ x > 0 has 
a non-trivial solution. We obtain the classes 



• CCS AN of “linearly lean clause-sets” (not having a non-trivial simple linear 
autarky), with CSAN C CCS AN C MCSAN\ 

• CSAT of “linearly satisfiable clause-sets” (satisfiable by a sequence of simple 
linear autarkies), with MS AT C CSAT C SAT. 



Linear autarkies , as introduced in & are obtained from simple linear autarkies 
by composition, corresponding to iterated reduction by simple linear autarkies; 
simple linear autarkies yield an autarky system, while linear autarkies yield a normal 
autarky system. The main point here is, that the reduction to the linearly-lean 
kernel can be done by a single linear autarky, and linearly satisfiable clause-sets are 
satisfiable by a single linear autarky. In Subsection 1.6.3 we discuss the special case 
of “balanced linear autarkies” . 



1.6 Connections to combinatorics 



We now discuss the connections between SAT and combinatorics in a wider context 
than the degree considerations of this report, concentrating on aspects related to 
minimal unsatisfiability and autarkies (if one is only interested in the results of this 
report, then these discussions may be ignored). A general source on SAT is the 
handbook j7j; a classical connection to combinatorics, random satisfiability, is dis- 
cussed in Chapter 8 (J2j) there, and of further general interest to combinatorics is 
Chapter 10 (Q) on symmetry (group theory), Chapter 13 (Q) on fixed-parameter 
tractable problems (for example treewidth and related notions), and Chapter 17 
(| 10C]) on the handling of various combinatorial designs to SAT solving, for ex- 
ample from Ramsey theory. Ramsey theory has strong connections to hypergraph 
colouring, which we discuss next; we mention, that applying SAT solving to solve 
hypergraph colouring problems is a powerful tool, and a recent overview can be 
found in Q (where especially van-der-Waerden numbers are discussed). 



1.6.1 Hypergraph colouring 

Hypergraph-colouring, especially 2-colouring, and SAT are closely connected; see 
0. Section 5] for a general introduction and overview on hypergraph colouring 
(from the combinatorial point of view), while a monograph is given with | jf0[ . An 
overview especially on the question of the minimum number of hyperedges for a 
given number of vertices in non-fc-colourable hypergraphs is given in |l6|] . 



Hypergraphs. For this introduction, a hypergraph G is a finite set of finite sub- 
sets of Z; so G itself is the set of hyperedges, i.e., E(G) := G, while (J G is the 
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set of vertices, i.e., V(G) := (J G. The set of all hypergraphs is denoted by 'H.yV. 
Let the deficiency be <5 h(G) := \E{G)\ — |V(G)|. Note that clause-sets are spe- 
cial hypergraphs {CCS C 'H.yV ), but their deficiency is defined differently. Hy- 
pergraphs G with 6n(G) = 0 are called square hypergraphs. Special hypergraphs 
are the positive clause-sets, and the set of all positive clause-sets is denoted by 
VCCS := {F G CCS : [j F C N} = {G G UyV : V(G) C N}. For F G VCCS 
we have 6{F) = 6u{F)- : obviously every hypergraph can be renamed to a positive 
clause-set. From general clause-sets F G CCS we obtain two hypergraphs: 

• F itself is a hypergraph (breaking the link between positive and negative 
literals, which are now just unrelated vertices). 

We note that we could have allows CCS = HyV, by allowing tautological 
clauses (i.e., clauses containing clashing literals) and self-complementary liter- 
als (—0 = 0). In certain contexts allowing such degenerations has advantages, 
but in our context is seems best to ban them (for example so we have a direct 
correspondence between clauses and partial assignments) . 

• The “variable-hypergraph” of F is {var(C) : C G F} G VCCS. This formation 
for example is important to apply methods from matching theory. 

For positive clause-sets both formations collapse to the identity, and we treat posi- 
tive clause-sets as representing (general) hypergraphs by (special) clause-sets. 

Colouring. A fc-colouring for k G No of G is a map / : V { G ) —?■ {0, . . . , k— 1} such 
that for all H G G there are x,y G H with f{x) / /(y); G is called fc-colourable if 
there exists a fc-colouring of G. Note that if there are H G G with \H\ < 1, then 
G is not fc-colourable for any fc. A hypergraph G is critically k-colourable if G is 
fc-colourable, not fc — 1-colourable, but for all H G G the hypergraph G\{H} is fc — T 
colourable. In the SAT-context there is no need to discard hyperedges containing 
at most one vertex, and then minimally non-k-colourability is more appropriate, 
that is G is not fc-colourable (possibly not colourable at all), while after removal of 
any hyperedge G becomes fc-colourable. The set of all minimally non-fc-colourable 
hypergraphs is denoted by MMC k C HyV for fc G No- We have {0},{{:r}} S 
MMC k for all fc G N 0 and iG2. 

We are especially interested in MAfC 2 . For G G MNC 2 holds 6h(G) > 0, as 
shown in |h|, and so we can consider the sets MAfC 2 a=k for deficiencies fc G No (all 
minimally non-2-colourable hypergraphs of deficiency (exactly) fc).^] The famous 
problem of deciding in polynomial time, whether a directed graph contains an even 
cycle, is equivalent to the problem of deciding “F G MAfC 2 H=0 ?” for F G %yV 
(via simple transformations), and this problem was finally solved in Q, [76|. It 
was conjectured in Q, that for all fc G N 0 the classes A 4J\fC 2 H=k are decidable 
in polynomial time (see also |43|, Conjecture 11.12.1]). More on this in Subsection 



A iAfCg H=0 (i.e., where all hyperedges have the same length). 

Translating hypergraphs into clause-sets. For a positive hypergraph G G 
VCCS we obtain the translation of 2-colouring to satisfiability via 

F 2 (G) :=GU{-H : H gG} GCCS. 

For a general discussion of such translations, also considering more colours, see |^8| , 
Subsection 1.2]. A hypergraph G G VCCS is 2-colourable iff F 2 {G) is satisfiable, and 

Indeed in jv|. Corollary 8.2] it is shown <5 h(G) > 0 for all G G A4J\fC k for fc > 2, as a simple 
application of tne autarky method; note that for G := {{1, . . . , n}} G MJ\fC k for fc < 1 and n > 2 
holds 5h(G) = 1 — n < 0. 



1.6.4. In pj one finds more information on vertex degrees in uniform elements of 
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G is minimally non-2-colourable iff F 2 (G) is minimally unsatisfiable, i.e., F 2 (G) G 
MU G G MMC 2 (this is easy to prove, and a special case of 1 55 , Lemma 8.1]). 
Regarding the deficiency we have 6(F 2 (G)) = <5r(G) + |F(G')| for 0 ^ G, and thus 
e.g. F 2 (MMC 2 n=0 fl VCCS ) is not contained in any MUs=k for some k € N. 

A slight generalisation of the image F 2 (VCCS) under this translation is the 
class of complementation-invariant clause-sets F G CCS , characterised by G G F •o- 
— G G F for clauses G, as introduced in [|b) (see also [43], Subsection 11.4.5]), while 
the image F 2 (VCCS) is the set of complementation-invariant PN-clause-sets, that 
is, clause-sets F where eve ry cla use G G F is positive (i.e., G C N) or negative 
(— G C N). See Subsection 1.6.4 for how “autarkies”, as considered on F 2 (G), can 
help understanding G. 



Translating clause-sets into hypergraphs. In the other direction a translation 
was provided in [72| . For F G CCS let 

e(F) := {C U {0} : G G F} U {{a, -v} : v G var(F)} G UyV. 



The hypergraph e(F) is 2-colourable iff F is satisfiable, and F is minimally unsat- 
isfiable iff e(F) is minimally non-2-colourable, i.e., e(F) G MMC 2 <*=> F G MU (the 
direction of the latter statement is stated in the proof of Theorem 3 in Q, 
the other direction is (also) very easy). Furthermore 6n(e(F)) = 6(F) — 1. Thus e 
embeds the classes MUs=k into MAfCg k _i, which motivates the conjecture, that 
all MMC 2 n=k for k G N 0 are polytime decidable, as a strengthening of the polytime 
decision of the MU$=k for k G N (recall Subsection 

We remark that via this embedding e we obtain a proof of 6(F) > 1 for F G MU 
from <5 h(G) > 0 for G G MMC (this is one of the proofs given in Q). In || also an 
alternative proof of <5 r(G) > 0 is given, based on matching the ory, p lus one further 
proof of 6(F) > 1, using linear algebra, as in |h|. In Subsection 1.6.3 we will further 
comment on these proofs, as they are unfolded in the theory of “autarkies”. 

We also remark, that the hypergraph class e(MUs=i) C MMC 2 H=0 has the 
property, that every hypergraph in it different from {{0}} has a vertex of degree 
2 (since every F G MUs-i different from {0} has a variable of degree 2). More 
generally, for all k G N every hypergraph in e(MUs=k) \ {{{0}}} C MMC | H=fe _ 1 
has a vertex of degree at most k + 1. We do not know whether the minimum 
vertex-degrees of general G G MMC 2 H=k for any (fixed) k G No are bounded. 



1.6.2 Hypergraph transversals 

For G G HyV let Tr(G) G HyV, the transversal hypergraph of G, be defined as the 
set of all minimal T C V(G) such that T fl H ^ 0 for all FL G G. The Transversal 
Hypergraph Problem is the computational problem, given G, G' G TtyV, to decide 
whether Tr(G') = G' holds. Equivalently, the input is G G TtyV, and it is to be 
decided whether G = Tr(G) holds (obviously this is a special case of the Transversal 
Hypergraph Problem, and by a polynomial-time translation the general case can 
be reduced to it). For an overview on this important problem and its many guises 
see p^ ]. It is known that the problem is solvable in quasi-polynomial time, and the 
long outstanding problem is whether it can be solved in polynomial time. 

An intersecting hypergraph is a hypergraph G G TtyV, such that for H,H' G G 
with H ^ H' holds H fl H' ^0, the class of all intersecting hypergraphs is denoted 
by TTLyV C TtyV. By definition we have G C Tr(G) for G G THyV, and it is 
not hard to see that for G G TTLyV holds G G MMC 2 iff Tr(G) = G. Thus the 
Transversal Hypergraph Problem is equivalent to the problem, deciding whether 
an intersecting hypergraph is minimally non- 2-colourable. The natural question 
arises for the decision of the classes (MMC CM'Hy'P)s H =k for k G No- The case 
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k = 0 has been handled in po|] , indeed not just deciding the class in polynomial 
time, but efficiently classifying the elements. The cases k > 1 appear to be open, 
and whether decision is possible in polynomial time for fixed k , or is even fixed- 
parameter tractable (fpt) in k, is an interesting test case for the general Hypergraph 
Transversal Problem, as well as it is relevant for the understanding of minimally 
non-2-colourable hypergraphs. 

The translation of intersecting hypergraphs G G THyV into clause-sets F 2 (G) G 
CCS yields also a natural and interesting class of clause-sets. Bihitting clause- 
sets, introduced in |2(], Subsection 4.2], are those F G CCS where F',F" C F 
with F' U F" = F, F' n F" = 0 exist, such that for all C' G F ' , C" G F" holds 
C' fl — C" ^ 0, while F',F " itself are clash-free (i.e., QJF') D — (1JF') = 0) and 
(U F")n — (U F") = 0). Obviously, the images under F 2 of intersecting hypergraphs 
are precisely the bihitting complementation- invariant PN-clause-sets (i.e., the set of 
bihitting clause-sets in the image of F 2 ), and deciding their minimal unsatisfiability 
is thus another manifestation of the Hypergraph Transversal Problem (directly re- 
lated to the decision “G = Tr(G))?”). And another one is to decide SAT for general 
bihitting clause-sets (as can be easily seen, and is discussed in [|g[ Subsection 4.3]; 
directly related to the decision “Tr(G) = G' ?”). 

In ||^, Theorem 8.14] (the first 6 sections are covered by |n], Q) the character- 
isation of Q (the intersecting G G AfA/"C^ H=0 ) is translated into this language. 



1.6.3 Autarkies for hypergraphs 



We discuss here now two autarky systems (recall Subsection 1.5 for a general intro- 
duction), which are especially relevant for hypergraph colouring. 



Balanced autarkies. Balanced autarkies for F G CCS (introduced in |36); ff3[ , 
Subsection 11.11.4] provides an introduction) are partial assignments ip, which in 
every clause of F they touch satisfy as well as falsify at least one literal (that is, 
for C G F with C fl {ip U — ip) / 0 holds G fl tp ^ 0 as well as G fl — ip ^ 0). This is 
a normal autarky system, and thus we basically have all the good property general 
autarkies have. Balanced autarkies are closely related to hypergraph colouring. 
The balanced autarkies for F are precisely the autarkies of F U {— G : G G F}, and 
every autarky for a complementation-invariant clause-set is automatically balanced. 
A clause-set is balanced-satisfiable, i.e., can be satisfied by a balanced autarky, iff 
it is NAE-satisfiable ( “not-all-equal” ; see |82j for basic results). 

Balanced autarkies provide the general autarky form for VCCS (whose elements 
are all trivially satisfiable, and thus unrestricted autarkies are not of interest here) , 
which represents hypergraphs for the 2-colouring problem: an F G VCCS is 2- 
colourable iff it is balanced-satisfiable, and F is minimally non-2-colourable iff it 
is minimally balanced-unsatisfiable. Finally we have balanced lean clause-sets (i.e., 
having no non-trivial balanced autarkies), and this is the appropriate notion of 
“leanness” for hypergraphs, as represented by the class VCCS ; more precisely, a 
hypergraph G is lean iff for an isomorphic F G VCCS (isomorphic as hypergraph) 
we have that F is balanced lean. For lean hypergraphs G we have <5 h(G) > 0, and 
this is properly treated by “balanced linear autarkies” . 



Balanced linear autarkies. The special case of “balanced linear autarkies” was 
introduced in s Section 6]; these are the simple linear autarkies for F U {—C : 
C G F} (recall Subsection [h|) .^] Equivalently, the balanced linear autarkies ip for 

6 ^More precisely one should speak of “balanced simple linear autarkies”, but for convenience 
“simple” is dropped. We note that “balanced linear autarkies” are balanced and linear autarkies, 
but in general a balanced and linear autarky need not be a balanced linear autarky, and one should 
speak of “balanced-linear autarkies”; again this is an abuse of language, motivated by the fact 
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F £ CCS are obtained from solutions x £ R"( F ) of M(F ) • x = 0, by translating 
the values Xi as discussed before (it is an easy exercise to see that this yields indeed 
balanced autarkies) . We have a non- trivial balanced linear autarky iff M (F) ■ x = 0 
has a non-trivial solution, and so, in other words, F is balanced linearly lean iff 
the columns of M(F) are linearly independent (iff rank(M(F)) = n(F)). Thus if 
F £ CCS is balanced linearly-lean, then 6(F) > 0 holds; furthermore, as shown in 
|55| , Lemma 7.2], there is then a matching in the clause- variable graph covering all 
variable nodes, and thus even S*(F) = 6(F) holds. By noting that F £ CCS is 
balanced linearly lean iff FU {— C : C £ F} is linearly lean, and considering VCCS, 
we obtain that for lean hypergraphs G (especially, minimally non-2-colourable) we 
have 6h(G) > 0. To say the argument again explicitly: Consider a hypergraph 
G £ VCCS ; then G (as a clause-set) is balanced linearly lean iff the variable-clause 
matrix has linearly independent rows, iff ^(G) is linearly lean (again, as a clause- 
set), which is implied by ^(G) being minimally unsatisfiable (or weaker, being 
lean), which in turn is equivalent to G (as a hypergraph) being minimally- non-2- 
colourable. This conclusion “The rows of the incidence matrix [our variable-clause 
matrix] of a minimally-non-2-colourable hypergraph are linearly independent over 
R.” is shown in |x|; see |l2|. Lemma 4.7] for this and related results, while the 
conclusion “6h(G) > 0” is discussed as Principle 2.1 i n p2| . For properties of 
minimally balanced linearly unsatisfiable clause-sets see 1 56 , Section 4]. 



Fundamental inequalities. We have yet seen two fundamental inequalities, 
namely 6(F) > 1 for F £ MCSAAf, as first shown in |J for minimally unsatisfiable 
clause-sets, and 6(F) > 0 for balanced linearly lean clause-sets, first shown in Q 
(as <5h(G) > 0 for minimally non-2-colourable hypergraphs) .[[^Autarky theory shows 
the general structure of the arguments: We find “obstructions” , which prevent these 
bounds from holding, where such obstructions are given by a subset F' C F where 
there is a partial assignment ip with <p * F' = T, while var (ip) (~l var(F' \ F) = 0. 
Now minimally unsatisfiable F do not have such F' , and thus the envisaged bound 
holds for them, and this is the argumentation in e.g. 90 [|. 

But one can go beyond this, exploiting autarky reduction. Note that ip is pre- 
cisely an autarky, and furthermore possibly one of a special structure. If we just 
look at general autarkies, then we obtain the first generalisation, to lean clause-sets 
or balanced lean clause-sets (covering the hypergraph cases). However often, due 
to the special structure, these special autarkies can be found in polynomial time, 
and their application yields some F' C F, such that the bound holds for F' (while 
for F £ MU we just have F' = F). If we have even an “autarky system”, then F' 
is uniquely determined, that is, does not rely on the choice of the autarkies in the 
reduction process. The case of main importance for this report is 5(F) > 1, where 
the autarkies are matching autarkies, and the reduced F' is the matching-lean ker- 
nel of F, while those F with F' = F are precisely the F £ M.CSAN . On the 
other hand, for hypergraph colouring the fundamental fact is 6(F) > 0 for balanced 
linearly-lean clause-sets, where the autarkies are balanced linear autarkies, and the 
reduced F' is the balanced-lincarly-lean kernel of F. In fact, via autarky reduction 
we obtain a general method to study decompositions, which we will discuss in the 
context of “QMA” . 



1.6.4 Qualitative matrix analysis (QMA) 

QMA can be understood as the analysis of matrices M over the real numbers in 
abstraction of the absolute value of the entries, but only their signs count, that is, 



that linear autarkies which are also balanced are apparently too general a concept to be useful. 
_pAn application yielding Fisher’s inequality (design theory) is discussed in Subsection 7.4 of 
fed] (while Seymour’s inequality is discussed there in Subsection 8.2). 
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one considers the qualitative class Q(M ), which consists of all matrices with the 
same dimensions as M, which have entry- wise the same signs as M (positive, zero, 
negative), and investigates when a property of M holds for all M' £ Q.(M). For 
example, a matrix M, such that all M' £ Q(M) have linearly independent rows, is 
called an L-matrix. The monography || is an excellent source on QMA until the 
1990’s, while a more recent overview is given in f}(i| . 

Starting from [^h|, which exploits Farkas’ lemma to understand (un)satisfiability, 
the connections to QMA have been first explored in Sections 3 and 5 i n |5l|| ; see ff3| , 
Subsection 11.12.1] for a more substantial introduction. It is shown in [^Jllemark 5 
in Section 5], that L-matrices correspond (nearly) precisely (up to transposition and 
handling of zero-rows/columns and repeated rows/columns) to balanced lean clause- 
sets, while lean clause-sets correspond (nearly) precisely to so-called L + -matrices 
(as investigated in |7(J). The square L-matrices are called SNS-matrices ; SNS- 
matrices are at the heart of the poly-time decision for MJ\TC^ a _ n (recall Subsection 



11.12.2] for an overview. 

Further in the translation of terms, now regarding unsatisfiability: unsatisfiable 
clause-sets correspond to sign-central matrices , minimally unsatisfiable clause-sets 
correspond to minimally sign-central matrices. So |)J Theorem 5.4.3] is yet another 
proof of 6(F) > 1 for F £ MU. The variable-degree, as studied in the current 
report, corresponds to the number of non-zero entries in the rows of the matrices 
(while the deficiency is the difference of the number of columns and the number of 
rows). The elements of MUg = i correspond to S -matrices, the elements of SMUg=i 
correspond to maximal S -matrices. 

As mentioned, autarky systems A (like balanced autarkies, matching autarkies, 
etc.) also yield a framework for decomposition theorems. The basic decomposition 
is into A-lean and A-satisfiable sub-clause-sets, as given in |3: Theorem 8.5] (for 
normal autarky systems), which corresponds to a certain unique decomposition of 
the clause-variable matrix into a triangular shape with two blocks on the diagonal, 
and generalises various matrix decompositions in QMA, as discussed in Jsl], Foot- 
note 7, Page 246]. A-lean clause-sets itself can be further decomposed, and the 
main result is ]^6|, Lemma 6] (reviewed in (hi|, Subsection 11.11.5]), generalising U 
Theorem 2.2.5], stating that a clause-set F £ CCS is minimally A-unsatisfiable iff 
F is barely A-lean (it is lean, but removal of any single clause destroys this) and 
A-indecomposable (no triangular decomposition into A-lean blocks is possible for 
the clause- variable matrix). 

1.6.5 Biclique partitions of (multi-)graphs, and algebraic graph theory 

We finish this overview on related themes in combinatorics by a field of graph theory, 
which, like QMA, can be understood as a study of clause-sets from a special angle, 
focusing on the conflict-structure of clauses. 

Certain aspects of algebraic graph theory. The starting point is |3^], where 
the problem of “addressing a graph” was introduced. One considers a symmetric 
matrix D of dimension m £ N over No, with a zero-diagonal, where the entries are 
interpreted as “distances” (in J33| the D it j are the distances between the nodes of 
some graph), and asks for the smallest N £ No, such that there are m codewords 
Ci,-.., Cm G {0,1,*}^ with the property, that the modified Hamming distance 
between c* and Cj, which simply ignores positions with *, is Dij. 

From our point of view, such a codeword is nothing else than a clause over the 
variables 1, . . . , N, while the modified Hamming distance is the number of clashes 
(conflicts). So the question is about the existence of clauses C* for i £ {1, . . . , to} 



1.6.1 ) , and the connections to autarky theory are explored in |56| ; see Subsection 
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over variables 1, ... ,7V, such that j = |C, H —C 3 | for i,j £ {1, . . . ,m}. How- 
ever yet “clause-sets” are not perceived/known as combinatorial objects, and their 
perspective is missing from the literature. See Chapter 9 of J95|, Chapter 9] for an 
introduction. A basic result of fid] is that if D has all entries outside the diagonal 
equal to 1, then N = m — 1 (see also [[52], Lemma 6.6] for a discussion in the con- 
text of eigenvalue methods; for a direct combinatorial proof see |j(|). This follows 
from the general result N > max(n+(D), n_ (D)) of |l3| (the “Lemma of Witsen- 
hausen”), where n+(D) resp. ri-(D) is the number of positive/negative eigenvalues 
of D. For the general case in |)8) it is shown, that if the distances j are indeed 
the distances between the nodes of some graph, then we have N < m — 1. 

Actually, the Lemma of Witsenhausen works for arbitrary matrices D over No 
with zero diagonal. Taking up the clause-set perspective again, the minimal number 
N of variables in a clause-set representing D (it is an easy exercise to see that N is 
finite, i.e., a representation is always possible) is equal to the minimal number of 0, 1- 
matrices of rank 1 which add up to D: A variable contributes precisely a “rectangle” , 
i.e., a matrix which is 1 at the entries I x J for some 0 ^ /, J C {1, . . . , m}, and 
otherwise 0, and these are precisely the 0, 1-matrices of rank 1. Considering D as the 
adjacency matrix A = D of some multigraph (where parallel edges are allowed), we 
see that N is also equal to the minimum number of bicliques into which the edge-set 
of that multigraph can be partitioned, and N is therefore denoted by bcp(A) £ No 
(the “biclique partition number” of A resp. the corresponding multigraph) f^] 

The notion “hermitian rank” has been introduced and studied in |35|] for arbi- 
trary hermitian matrices A (square matrices with complex numbers as entries, such 
that transposing the matrix and taking the complex conjugate of each entry yields 
back the original matrix), denoted by h{A) := max(n+(A), n_(A)) £ No- So the 
Lemma of Witsenhausen takes the form, that for symmetric matrices A over No 
with zero diagonal holds bcp(A) > h{A). 



Conflict analysis. The essential observation is now that we can go back and forth 
between biclique partitions of multigraphs and clause-sets. In one direction we can 
understand clause-sets F as representations of biclique partitions of multigraphs, 
where for each vertex we get a clause, and from each biclique we obtain a variable, 
where the two sides of the biclique are the positive and negative occurrences of the 
variable. So we can understand a multigraph together with a biclique partition as a 
clause-set, and we can use tools from clause-set-logic to analyse the pair multigraph 
with biclique-partition. The deficiency then becomes the difference between the 
number of nodes and the number of bicliques. Satisfiability means that it is possible 
to select from each biclique one side such that all vertices are covered. 

In the other direction we can understand biclique partitions of multigraphs (or, 
equivalently, representing a matrix A as above as a sum of rank-1 matrices over 
{0, 1}) as representations of clause-sets F , namely the nodes of the conflict multi- 
graph cmg(F) are given by the clauses, while the edges are the conflicts (clashing 
literal occurrences x, —x), and the bicliques are given by the variables (their positive 
and negative occurrences) . In this way we can analyse the influence of the “conflict 
structure” on properties of clause-sets; the basic notions, as introduced in 1 54 with 
underlying report |53|], are as follows. 

For F £ CCS let CM (F) (the conflict matrix) be the square matrix of dimension 
c(F) over No, with entries | C (~l —D\ for C,D £ F (thus with zero diagonal), i.e., 
CM(F') is the adjacency matrix of cmg(F). So we can use the hermitian rank as a 
measure h : CCS —> No (as first done in (53, Subsection 3.2]), namely 



h(F) := /i(CM(F)); 



8 )ln M we used u n s (A) instead, the “symmetric conflict number”. 
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see Points 1, 3 in [^6[ Section 2] for various equivalent characterisations |^] By 
definition we have bcp(F) := bcp(CM(F)) < n(F), and thus h(F) < n(F). Since 
for a principal submatrix A' of a hermitian matrix A holds h(A') < h(A) (this 
follows by “interlacing”; see |n], Theorem 9.1.1]), we get h(tp * F) < h(F) for all 
partial assignments p, and also h(F') < h(F) for all F' C F, which gives motivation 
to consider h(F) as a complexity measure for F £ CCS. 

In Q also the hermitian defect dh : CCS — v No has been introduced as 

6 h (F) := c(F) - h(F), 



and thus 5(F) < 5h(F)-, see Point 2 in [^6], Section 2] for a geometric characterisation 
(as the “Witt index” of the quadratic form associated with CM(F)). Actually 
5 *(F) < 5h(F) holds and even stronger properties (see |54|, Subsection 3.3]). An 
important property is (again) 5h(p * F) < 5h(F) for all F £ CCS and partial 
assignments ip together with Sh(F') < 5h(F) for F' C F, by Q, Corollary 9], and so 
we might consider the hermitian defect as a stabilised version of the maximal defect 
(both are also complexity measures; recall that we have fixed-parameter tractable 
SAT decision for input F £ CCS in the parameter 5*(F)). Note that in general 
we can have 5*(p * F) > 5*(F), for example F := {{1}} has 5*(F ) = 5(F) = 0, 
while for F' := (1 — > 0) we get F' = {T}, and thus 5*(F') = 5(F') = 1. See 
fit] . Subsection 3.3] and |57], Subsection 11.2] for more information on 5*(ip * F); 
splitting on a single(!) variable is very important for this report, with the basic fact 
5*((x — t 1 ) * F) < 5(F) for F £ A iU and any literal x. 

The first direct applications applied 5(F) < <5 h (F) for F £ CCS , namely that 
for a hitting clause-set F £ HIT (equivalently, all entries of CM(E) outside the 
diagonal are non-zero) with a regular conflict multigraph (i.e., all entries of CM(F) 
outside the diagonal are equal) we have 5\,(F) < 1, and thus 5(F) < 1 (|54|, Theorem 
33]) 



PT 



We get that SMUs^i = UHITs=\ is (precisely) the class of unsatisfiable 
hitting clause-sets with regular conflict multigraph ( |54|, Corollary 34] ; a combinato- 
rial proof of this was independently found in fjT[ Lemma 11]), and is also (precisely) 
the class of unsatisfiable clause-sets F with 5y 1 (F) < 1 (|54|, Theorem 26]). 

A clause-set F £ CCS is called exact (|54|, Subsection 3.4]) if bcp(F) = n(F), 
that is, F is optimal in realising cmg (F) with respect to the number of variables. De- 
ciding exactness is coNP-complete, while the special class of eigensharp clause-sets , 
defined by h(F) = n(F ), or, equivalently, 5h(F) = 5(F), is decidable in polynomial 
time. With |54], Theorem 14] every eigensharp clause-set is matching lean. This 
leads to [^, Conjecture 16], “Every exact clause-set, whose conflict-matrix is the 
distance matrix of some connected graph, is matching lean.” , which generalises the 
already mentioned main result of ]98|| (the proof of the “squashed cube conjecture”). 

As already mentioned, we consider h(F) for F £ CCS as some form of complexity 
measure, measuring the complexity of representing the conflicts of F via simple 
matrices. In J2(| polytime SAT decision in case h(F) < 1 was shown, while the 
cases h(F) < k for fixed k > 2 are open; an interesting stepping stone is to show 
polytime SAT decision for F £ CCS with bcp(F) < k (recall CCSh cp <k Q CCSh<k)- 
The notion of blocked clauses, a special type of clauses which can be removed sat- 
equivalently, introduced in Q, is important here, and |2(], Theorem 3] shows, 
that from F £ CCSh< 1 after elimination of all blocked clauses (which yields a 
unique result) we obtain F' C F with bcp(F') < 1. We recall from Subsection 
1.6.2| , that SAT-decision for F' is now a special case of the Transversal Hypergraph 
Problem, namely, as shown in ]2(| Lemma 11], the problem is exactly the Exact 
Transversal Hypergraph Problem , where every transversal must be “exact” , that is, 
must intersect every hyperedge in exactly one vertex; this problem is decidable in 



9 )For Point 1(c) there it must be a “diagonal matrix A'”. 

10 dn M unfortunately the term “uniform” was (mis)used instead of “regular”. 
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polynomial time by |2^], and thus we get SAT-decision for CCSh<i in polynomial 
time. The characterisation of F G MU with bcp (F) < 1 is an open problem (while 
we have polytime membership decision for MUb cp <i), and by |26|, Conjecture 16] 
they would have a very simple structure. 

We conclude by mentioning that in [^, Section 6] the basic facts are gener- 
alised to non-boolean clause-sets, and that by extending the reduction of multi- 
clique partitions to biclique partitions from |yj a new and interesting translation 
from non-boolean to boolean clause-sets was obtained. 



1.7 Overview on results 



Sections ^]to ^provide foundations for the main results in the later sections. In Sec- 
tion |] basic notions and concepts regarding clause-sets and autarkies are reviewed. 
In Section we discuss minimal unsatisfiability, with some auxiliary results on satu- 
ration (adding literal occurrences to clauses, to make minimal unsatisfiability robust 
against splitting) and splitting. Section [| reviews the concept of “variable-minimal 
unsatisfiability”, as introduced in |ll|, i.e. , the class MU C VMU C US AT- There 
are mistakes in this paper, and we rectify them here: 

• we show that VMU C LEAN holds (Lemma [0|) ; 

• we provide a corrected characterisation of VMU (Lemma [D]); 



• and we give a c orrected proof of polytime decision of VMUs=k for fixed k 
in Theorem 4.7, where we also obtain the stronger result, that decision of 
VMU (i.e., deciding whether for input F G CCS holds F G VMU or not) is 
fixed-parameter tractable in the deficiency 5(F). 



Section is then concerned with singular variables, eliminating them via singular 
DP-reduction, and creating them via “singular extensions” . An important auxiliary 
result is Lemma [5.4| , showing that eliminating singular variables is harmless for 
bounds on the minimum variable-degree; we also show various auxiliary results 
on unit-clauses in minimally unsatisfiable clause-sets. This block of preparatory 
sections is concluded by Section |] on full subsumption re solut ion, an ubiquitous 
reduction (and extension); as an application, in Theorem |6. 13 we can determine 
precisely the possible n(F) and c(F) for F G MUs=k- 

The first main results (but still on the preparation side) one finds in Section 
which introduces the numbers nM(fc) G N and proves exact formulas and sharp 
lower and upper bounds; the point here is that the introduction of nM(fc) happens 
via a recursion which is tailor-made for our application in Section |8j, but which 
makes it somewhat difficul t to determine the n umbe rs in a global way. A main 
technical result is Theorem 7.15 , while Theorem 7.21 proves the general formula. 

In Section || then we find a basi c ce ntral result of this report, the upper bound 
/ ivd(MUs=k ) < nM(fc) (Theorem |g.3|) . Section || is concerned with generalising 
this upper bound. An interesting auxiliary class S£V C C CS , clause-sets where 
deficiency and surp lus coincide, is introduced in Subsection 9T; the main lemma 
here is Lemma 9.5, w hich shows that unsatisfiable elements of S£V are in fact in 
VMU. In Subsection |9.2| the upper bound for MU then is lifted to lean clause-sets 
in Theorem |9 . and also sharpened via replacing the deficiency 5 by the surplus a. 



Theorem 9.12 shows that the upper bound is sharp for any class between VMU n 
S£V and C£AN . 



Section 10 concerns algorithmic applications. A corollary of Theorem 9.8 



is, 



that if the asserted upper bound on the minimum variable degree is not fulfilled, 
then a non-trivial autarky must exist (Lemma 10.1). Since the variable-set of such 
a non-trivial autarky is polytime computable, we show in Theorem 10.2 that we can 
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indeed establish the upper bound shown for lean clause-sets also for general clause- 
sets, after a polytime autarky-reduction. In Subsection 10.2 then the problem of 
finding such autarky (that is, finding the assignment) is discussed, with Conjecture 
10. 3 making precise our believe that one can find such autarkies efficiently. Theorem 
10. £ pinpoints the “critical” class MCC1Z C SAT , which is polytime decidable, 



and where we know that these clause-sets are satishable, but we even don’t know 
how to find any non-trivial autarky efficiently. This block on generalisations of 
the min-var-degree upper bound is concluded by Section [0], where we discuss the 
possibilities to generalise it to matching-lean clause-sets (where only the absence of 
special (non-trivial) autarkies is guaranteed). 

In Section [[U we then turn to the study of the numbers pnM(fc) := /j,vd (MUg=k), 
looking now for improved upper bounds and matching lower bounds. We present 
two infinite classes of deficiencies k with /xnM(fc) = nM(/c), and present a general 
method of obtaining lower bounds for /xnM(fc), via counting full clauses (clauses 
containing all variables — these clause are strong structural anchors). In Section H 
we introduce a general recursive method to obtain upper bounds like nM(fc), via the 
“non-Mersenne operator” NM(/), which takes a “valid bounds fu nction” /, that is, 
some partial information on /inM(/c), and improves it (Definition |l3.12 ). Theorem 
1 3 . 1 0| shows that this inde ed yiel ds a valid method for improving upper bounds on 
/rnM (k), while in Theorem 13.15| we demonstrate how this method recovers nM(fc), 
by just starting with the information /jiiM ( 1 ) = 2. In Section [hi] we harvest (first) 
fruits of these methods. First in Theorem 14.1 we show /inM(fc) = nM(fc) for k < 5. 
Then in Theorem 14.3 we prove fi nM(6) = nM(/c) — 1 (using a variety of structural 
results on MU provided in this report). Plugging this information on /xnM into 
our machinery, we obtain the improved upper bound /inM < nMj in Theorem 14.5, 
while in Theorem 14.5 we determine nMi(fc) numerically. 

Finally, in Section 15| open problems are stated, thoroughly discussing research 
perspectives, including nine conjectures. Subsection 15.2 discusses improved up- 
per bounds for yunM(/c) from the forthcoming work |68|| . Subse ction 15.3 is about 
improved lower bounds, via counting full clauses. In Lemma 15.2 we cite from 
the work in progress foil (to be completed soon), which provides improved lower 
bounds via the “Smarandache primitive function” S- 2 (k), yielding the first-order 
asymptotic determination of /inM(fc) ~ k (Corollary 15.4), where now the open 



question is about the asymptotic determination of /inM(fc) — k. In Subsection 15.4 
we discuss generalisations to no n-boolean clause-sets. 

The central Conjecture |l5^ of the project of “understanding MU” , on the fi nitely 
many “characteristic patterns” for eac h MU s=k, is discussed in Subsection |l5.5 



An important special case is Conjecture 15.7 (now a fully precise statement), about 
the classification of unsatisfiable hitting clause-sets (or “disjoint /orthogonal tau- 
tologies” in the terminology of DNFs). In Lemma 15. 9| we show how two of the 
conjectures together yield computability of nM(fc). 

This report is a substantial extension of the conference paper |bi|: Section 3 
there has been extended to Section [l] here, with considerable more details and 
examples on non-Mersenne numbers. Section 4 there is covered by Sections II 
and 10, with various additional results (for example showing sharpness of the upper 
bound for CSAAf). And the results for Section 5 there are contained in Subsection 
PH here. All other sections in this report are new. 



2 Preliminaries 



We follow the general notations and definitions as outlined in |43| , where also further 
background on autarkies and minimal unsatisfiability can be found. We use N = 
{1,2, . . .} and No = NU {0}. For the binary logarithm we use ld(a;) := log 2 (a;) 
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( “logarithm dualis” ) . 

We apply standard set-theoretic concepts, like that of a map as a set of pairs, 
and standard set-theoretic notations, like f(S) = {f(x) : x £ S} for maps / and 
S C dom(/), and “C” for the strict subset-relation. We use also the less- known 
notation “ A kJ B" for union in case A , B are disjoint, that is, A kJ B := All B is only 
defined for AC\B = 0. For maps f, g with the same domain X we use / < g :<£=>■ V x £ 
X : /( x) < g(x) (i.e., pointwise comparison), while / < : f(x) < g{x). 

2.1 Clause-sets 

The basic structure is a set CIT, the elements called “literals”, together with a 
fixed-point free involution called “complementation” , written x £ CIT *-^x£ CIT ; 
so the laws are x ^ x and x = x for x, y £ CIT . We assume Z \ {0} C CIT, with 
x = —x for x £ Z \ {0}. For a set L of literals we define L := {x : x £ L}. 
Furthermore a set N C VA C CIT , the elements called “variables”, is given, with 
CIT = VA kJ VA. Variables are also called “positive literals” , while literals v for 
v £ VA are called “negative literals” . The “underlying variable” of a literal is given 
by the operation var : CIT — > VA (“forgetting complementation”), with var(-u) := v 
and var(u) := v for v £ VA. 

Example 2.1 We can thus write e.g. 1, 6 for two (different) variables, and 1, 5, —1 
for three (different) literals. In examples we will also use v, w, a, b , c and such letters 
for variables (as it is customary), and accordingly v etc. for literals, and in this 
context (only) it is then understood that these variables are pairwise different. So 
{v,w,x, x}, when given in an example (without further specification), denotes a set 
of literals with |{i>, w, x,x}\ = 4 and |{v, w, x,x} D VA\ — 3. 

Without restriction we could assume CIT = Z \ {0} (as we did in the Introduc- 
tion), but it is often convenient to use arbitrary mathematical objects as variables. 
All our objects build from literals are finite, and thus, because of the infinite supply 
of variables, there will always be “new variables” (that’s the mathematical point of 
having natural numbers as variables — we won’t use the arithmetical structure) . 

A clause C is a finite and clash-free set of literals (i.e., CC\C = 0), the set of all 
clauses is CC. A clause-set is a finite set of clauses, the set of all clause-sets is CCS. 
The simplest clause is the empty clause J_ := 0 £ CC, the simplest clause-set is the 
empty clause-set T := 0 £ CCS. The set of all hitting clause-sets is denoted by 
HIT C CCS, those F £ CCS such that two different clauses C,D £ F , C ^ D, 
have at least one clash, i.e., Cn D ^ 0. In the language of DNF, hitting clause-sets 
are known as “orthogonal” or “disjoint” DNF’s; see ||T3[ Chapter 7]. 

Example 2.2 We have e.g. (1, 2, —3} £ CC, while { — 1, 1} ^ CC. The only clause- 
set in HIT containing the empty clause is {_L} £ HIT ■ An example of a non-hitting 
clause-set is {{1, 2}, { — 1, 2}, {3}} £ CCS\HIT, where we obtain an element of 
HIT if we add literal —2 to the third clause. 

We use var(F) := Ucg.F var (^) f° r the set of variables of F £ CCS, where 
var(C) := {var(ir) : x £ C} is the set of variables of cla use C £ CC. The possible 
literals for a clause-set F are given by lit(E) := var(E) U var(F), while the actually 
occurring literals are just given by (J F (the union of the clauses of F). A literal x 
is pure for F if x £ IJ F. For a clause-set F we use the following measurements: 

• n(F) := |var(E)| £ No is the number of variables, 

• c(F) := |F| £ No is the number of clauses, 
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• 6(F) := c(F) — n(F) £ Z is the deficiency (the difference of the number of 
clauses and the number of variables), 

• £{F) := EcgfI^I ^ No is the number of literal occurrences. 

We call a clause C full for a clause-set F if C £ F and var(C) = var(F), while 
a clause-set F is called full if every clause is full. For a finite set V of variables let 

A(V) := {C £ CC : var(C') = V} £ CCS. 

Obviously A(V) £ HIT is the set of all 2 1 1 1 full clauses over V, and F £ CCS is 
full iff F C A(var(F)). We use A n := A({1, . . . ,n}) for n £ No- Dually, a variable 
v £ VA is called full for a clause-set F if for all C £ F holds v £ var(C). A 
clause-set is full iff every v £ var(F) is full. 

Example 2.3 For F := {_L, {1}, { — 1, 2}} we have: 

1. var(F) = {1, 2}, lit(F) = {-1, 1, -2, 2},(jF = {-1, 1, 2}. 

2. Literal 2 is pure for F (the other literals in lit(F) are not pure). 

3. n(F) = 2, c(F) = 3, 6(F) = l, t(F) = 3. 

4- {^1,2} is a full clause of F, while the two other clauses are not full. 

5. F has no full variable, while F \ {_L} has the (single) full variable 1. 

The standard “complete” full clause-sets are Aq = {_!_}, Ai = {{—1}, {1}}, 

A 2 = {{-1,-2}, {-1,2}, {1,-2}, {1,2}}, 



and so on. 

We often define a class of clause-sets via some measure p as follows: 

Definition 2.4 Consider a class C C CCS and a measure fi : CCS — >• R. For a £ R 
we use Cfj, =a '■= {F £ C : n(F) = a}, and similarly we use C^a and analogous 
notations. 

When we use the form “C M n 0 ”, then p stands for a measure (e.g., p = 5 or p = n). 
Example 2.5 CCS n =o = CCSg-o = {T, {_L}} , CCS c —o = {T}, and CCS n< o = 0. 

2.2 Semantics 

A partial assignment is a map ip : V — {0, 1} for some finite (possibly empty) 
set V C VA of variables, where var(<^) := V and lit(y>) := var(iyj) U var(tp). The set 
of all partial assignments is denoted by TASS. For a literal x £ lit(y>) we also define 
p(x) £ {0, 1}, via p(v) := 1 — p(v) for v £ var(y>). Via a small abuse of language 
we define <p _1 (e) := {x £ lit(<p) : p(x) = e} £ CC for e £ {0,1}. Special partial 
assignments are the empty partial assignment () := 0, and for literals x £ CIT and 
£ £ {0, 1} the partial assignment (x — >■ e) £ TASS, with var((a; —> e)) = {var(a:)} 
and (x -> e)(x) = e. 

The application of a partial assignment p £ TASS to a clause-set F is denoted 
by p * F, which yields the clause-set obtained from F by removing all satisfied 
clauses (which have at least one literal set to 1), and removing all falsified literals 
from the remaining clauses: 

<p * F := {C \ <p _1 (0) : C £ F A C n v? -1 (l) = 0} G CCS. 
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This definition is motivated by the default interpretation of a clause-set as a “con- 
junctive normal form” (CNF), where a clause is understood as a disjunction of 
literals (thus is satisfied iff at least one literal in it is satisfied), while a clause- 
set is understood as a conjunction of its clauses (thus is satisfied iff all clauses 
are satisfied). A clause-set F is satisfiable iff there is a partial assignment tp with 
tp* F = T, otherwise F is unsatisfiable. The set of satisfiable clause-sets is denoted 
by SAT C CCS , while US AT := CCS \ SAT denotes the set of all unsatisfiable 
clause-sets. 



Example 2.6 If F £ US AT and for F' € CCS holds F C F' , then also F' £ 
USAT (satisfying a clause-sets gets harder the more clauses there are). 

By definition we have tp * F = T iff\/D£F : 9J _1 (1) flfi / 0; thus F G SAT 
iff there is a clause C G CC with C n D ^ 0 for all D G F. (We could write 
“C D D ^ -L ” here, but it appears somewhat more natural to use “0 ” here.) 

The unsatisfiable hitting clause-sets are denoted by UHAT := USAT C HAT. 



Example 2.7 T G SAT C\HAT and {_L} £UHAT ■ In general a full clause-set F 
is unsatisfiable iff F = A(v&t(F))), and thus A(V) G UHAT for all finite V C VA. 

The fundamental property for F G HAT: Consider tp, tp G TASS , such that there 
are C,D £ F, C ^ D, with tp * {C} = tp * { D } = {_L} (that is, 1 G tp * F fl tp * F , 
where there are different falsified clauses for these two partial assignments). Then 
tp,tp are incompatible, i.e., there is v G var (tp) n var(^) with tp(v) ^ ip(v). 

It follows easily that for F G HAT holds F G USAT Y]r.a f = 1. 

A nice exercise is to show UHATs<o = 0 (in Section 2.t a more general result 
is stated). 

Finally, the semantical implication F \= C for F G CCS and clauses C G CC 
holds iff M tp G TASS : tp* F = T =>■ tp * {C} = T. We have F G USAT F |= _L. 



2.3 Resolution 

Two clauses C,D G CC are resolvable if \C CD\ = 1, i.e., they clash in exactly one 
variable (called the resolution variable var(x), while x is called the resolution literal). 
For two resolvable clauses C and D, the resolvent CoD := (C U D) \ {x,x} G CC 
for C fl D = {x} is the union of the two clauses minus the resolution literal and its 
complement. As it is well-known (the earliest source is § 0 ) , a clause-set F is 
unsatisfiable iff via resolution (i.e., closing F under addition of resolvents) we can 
derive _L, and, more generally, we have F |= C iff from F via resolution a clause 
C' C. C is derivable. 

An important reduction for clause-sets F G CCS and variables v G VA, resulting 
in a clause-set satisfiability-equivalent to F (satisfiable iff F is; sometimes called 
“equi-satisfiable” ) and with variable v eliminated, is DP-reduction 

DP„(E) := {C G F : v var(C)} U {CoD : C, D £ F /\ C HD = {v}} £ CCS 

(also called “variable elimination”), obtained from F by removing all clauses con- 
taining variable v from F, and replacing them by their resolvents on v. See [|65| for a 
fundamental study of DP-reduction. The satisfying assignments tp of DP„(F) (i.e., 
<p*DP„(E) = T) with var (tp) = var(E)\{u} are precisely the satisfying assignments 
tp of F with var (tp) = var (F), when restricting tp to var(E) \ {u}. Logically, DP„(F) 
is equivalent to 3 v : F , the existential quantification of v for F (but we do not use 
quantifiers in this report, so this remark might be ignored here). 
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2.4 Multi-clause-sets and restrictions 



These notions are generalised to multi-clause-sets, which are maps F : CC — >• No, 
such that the underlying set of clauses {C G CL : F(C) T 0} is finite, and so we 
speak of the underlying clause-set; the set of all multi-clause-sets is denoted by 
CCS := {F G Nq £ : CC \ F -1 (0) is finite}.^ Clause-sets are implicitly promoted 
to multi-clause-sets, if needed, by using their characteristic functions, and multi- 
clause-sets are implicitly cast down, if needed, to clause-sets by considering the 
underlying clause-set; “if needed” refers to operations which either require multi- 
clause-sets or clause-sets. If however we want to make explicit these operations, we 
use els : CCS — t CCS (with cls(F) := CC \ F” 1 ( 0)) and els : CCS — > CCS (with 
cls(-F)(C) := 1 if C G F, and cls(F)(C) := 0 otherwise). For F G CCS we extend 
the basic operations in the obvious way: 

• var(F) := var(cls(i ? )), lit(F) := lit(cls(F)), IJ F := (Jcls(F). 



• n(F) := n(c\s{F)) G N 0 , c(F) := J2ceCC F ( G ) e N o, S(F) := c(F ) - n(F) G 
Z, e(F) := 

SceF F(C) ■ \C\ G N 0 . 

The application of partial assignments <p G VASS to a nrulti-clause-set F G CCS 
yields a mute-clause-set <p * F G CCS , where the multiplicity of a clause C G CC in 
ip * F is the sum of all multiplicities of clauses D G F (i.e., D G cls(F)) which are 
shortened to C by tp: 



(‘f * F)(C) := J2 F ( D )' 

DeF,Dr\ V >~ 1 ( 1)=0, D\ip~ 1 ( o)=c 



Example 2.8 If ip is a total assignment for F (assigns all variables of F, that is, 
var(y>) = var(F) ), then tp * F is {m * _L} ; denoting the multiplicity of a clause by a 
(formal) factor, with m = YlceF cn^- 1 (i)=0 F ( F ‘) (so m = 0 O tp * F = T ). 

For us the clause-sets are the objects of interests, while multi-clause-sets are only 
aux iliary devices, created by the operation of “restriction” defined next (Definition 
pTg| ). However we have to take care of the details, and thus together with introducing 
a class C C CCS we also introduce the corresponding class C C CCS of multi-clause- 
sets (using the generalised definition of C), where we must discuss the relation. To 
start with, the classes SAT and US AT are invariant under multiplicities, that 
is, a multi-clause-set is in it iff the underlying clause-set is in the underlying class of 
clause-sets ( SAT resp. US AT). The other extreme we have with the class IlIT of 
multi-hitting-clause-sets, which disallows multiplicities, that is, all multiplicities 
must be 1 (since clauses can not clash with themselves, by definition of clauses), 
and thus up to the canonical identification the classes T-LIT and TIT are identical. 

An important operation with nrulti-clause-set is the “restriction” to a set of 
variables (see Subsection 3.5 in |57j for more information): 

Definition 2.9 For a set V C V14 of variables and a multi- clause- set F G CCS by 
-F[V] G CCS the restriction of F to V is denoted, which is the multi- clause- set 
obtained by removing clauses from F which have no variables in common with V , 
and removing from the remaining clauses all literals where the underlying variable 

is not inV : 

F[V}(C) := Y, F ^- 

DeF, var (D)r\V^H, Dr\{V UV)=C 

Here it is essential that F\V\ is a multi-clause-set, and if previously unequal clauses 
become equal, then accordingly the multiplicity is increased. 

11 lln earlier papers we used “A4CCS” instead of “ CCS ” . 
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Example 2.10 {{a}, {a, b}, {£»}, {a, 5}}[{a}] = {2 * {a}, {a}}. 

Simple properties of this operation are (for multi-clause-sets F and V, V' C VA)\ 

1. F[0] = T, F[V] = F \ {_L} for var(E) C V (where F \ F' for a clause-set F' 
means that all occurrences of clauses from F' are removed from F). 

2. (F[V])[E'] = F[V nV']. 

Clause-sets F, G are called isomorphic, if the variables of F can be renamed 
and potentially flipped so that F is turned into G. More precisely, an isomorphism 
a from F to G is a bijection a : lit(F) — >• lit(G) which preservers complementation 
(a(x) = a(x)), and which maps the clauses of F precisely to the clauses of G ; when 
considering multi-clause-sets, then the isomorphism must preserve the multiplicity 
of clauses (that is, G(a(C)) = F(C) for all C G CC). 

2.5 Degrees 

For the number of occurrences of a literal x G CTT in a (multi-) clause- set F G CCS 
we write 

ldH*):= E F ( C ) e No, 

c&F,xec 

called the literal-degree, while the variable-degree of a variable v is defined as 
vdir(u) := kip (v) + ld.p (tl) G No- A (multi-)clause-set F is called variable-regular 
if all variables v G var(F) have the same degree, or, stronger, literal-regular, if all 
literals x G lit(F) have the same degree. A singular variable in a (multi-)clause- 
set F is a variable occurring in one sign only once (i.e., 1 G {ldF(u), ld.p(^)})- A 
(multi- (clause-set is called non-singular if it does not have singular variables. The 
central concept for this report is the degree of a variable with minimal occurrences: 



Definition 2.11 We define the minimum variable degree //vd : CCS -> NU 

{Too} (“min-var- degree” for short) as follows (which also works for multi-clause- 
sets F): 

• For F G CCS with n(F) ^ 0 we let /ivd (F) := min.„ gvar (^) vd^ (u) G N. 

• While for n(F) = 0 we set gvd(F) := Too. 

For a class C C CCS of clause-sets let /xvd(C) G No U {Too} be the supremum of 
/ivd (F) for F G C with n(F) > 0, where we set /xvd(C) := 0 if there is no such F 
(while otherwise we have gvd (C) >1). 

By definition we have gvd (C) < /jvd(C') for CCC'C CCS. 

Example 2.12 For F := {2 * { a , b}, {a, 6}, {b, c}} G CCS we have: 

• ld_p(a) = 2, ldi?(a) = l, ldF(&) = 3, Mf( 6) = 1, ldf(c) = 1, ldf(c) = 0. 

• vd F (a) = 3, vd^(&) = 4, vd^(c) = 1. 

• fxvd(F) = 1. 

Examples for /ivd(C) are: 

• A tvd(0) = 0. 

• pvd (CCS) = Too. 
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• Mvd({T, {_!_}, {{u},{w}},F}) = 2. 



The simplest but relevant class of clause-sets for us is given by the A(V) (the 
unsatisfiable full clause-sets; these are the simplest unsatisfiable clause-sets): 



Lemma 2.13 For n G Nq we have 



1. n(A n ) = n, c(A n ) = 2 n , S(A n ) = 2 n - n. 

2. A n is full and unsatisfiable, and thus A n G WHTTs= 2 n -n ■ 

3. A n is literal-regular (thus variable-regular) , with pvd(A n ) = 2". 



Further prope rties of unsati sfiable ful l clause-sets one finds i n Example 2.2()|, Lemma 
].!; , Corollary 3.11 , Lemmas (19. 6. 1C , Corollaries 6.11 , 6.12 , and Examples 922, 9.10 . 
Properties of satisfiable full clause-sets are found in Example 10.7. 



2.6 Autarkies 

Besides algorithmic considerations, which were always present since the introduction 
of the notion of an “autarky” in J7f| , also a kind of a “combinatorial SAT theory” 
has been developed around this notion of generalised satisfying assignments. A 
general overview is given in [|43| , with recent additions and generalisations in J5?J. 
An autarky (see jl|| Section 11.8]) for a clause-set F G CCS is a partial assignment 
ip G VASS which satisfies every clause C G F it touches, i.e., for all C G F 
with var (<p) n var(C) ^ 0 holds ip * {C} = T ; equivalently, for all C G F holds 
C C p _1 (0) T 0 => Cny) _1 (l) 0. The simplest examples for autarkies are as 

follows: 



Example 2.14 The empty partial assignment () is an autarky for every F G CCS 
(no clause is touched), and more generally all ip G VASS with var (ip) fl var (F) = 0 
are autarkies for F , the trivial autarkies. On the other end of the spectrum every 
satisfying assignment for F (i.e., ip * F = T ) is an autarky for F (every clause is 
satisfied). A literal x G CTT is a pure literal for F iff ( x — » 1) is an autarky for F . 

If p is an autarky for F, then ip* F C F holds, and thus ip * F is satisfiability- 
equivalent to F. Autarkies mark redundancies, and the corresponding notion of 
clause-sets without such redundancies was introduced in ^o), namely a clause-set 
F is lean if there is no non-trivial autarky for F, and the set of all lean clause-sets 
is denoted by CSAM C US AT CJ{T}. The class CSAN of lean multi-clause-sets is 
invariant under multiplicities. 

Example 2.15 Some simple examples: 

1 ■ T , {-L}, {M, {T}}, {{u}, {«}, {w}, {W}} G CSAM. 

2. If F, F' G CSAM, then FUf'G CSAM. 

3. If F G CSAM and F' G CCS with var(F') C var (F), then F U F' G CSAM . 

4 ■ {M,{T},{w}} ^ CSAM. 

A weakening is the notion of a matching-lean clause-set F (introduced in 
13, Section 7]; see Section 11.11] for an overview), which has no non-trivial 
matching autarky, which are special autarkies given by a matching condition: 
for every clause touched, a satisfied literal with unique underlying variable must 
be selectable; the class of all matching-lean clause-sets is denoted by MCSAM D 
CSAM. 
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Example 2.16 A clause-set F £ CCS is matching-lean (F £ MCEAN ) iff for all 
F' C F holds 5(F' ) < 1 5(F) ($% Theorem 7.5]). Thus for every matching-lean 
multi-clause-set F ^ T we have 5(F) > 1 m generalising §)■ 

F {{1, 3}, {2, —3}, {3}, {—3}} has the matching autarky (1 — »• 1, 2 — » 1), while 
for F' := F U {{1, 2}} we have F' £ MCEAN . Note 5(F) = 1 = <5 ({{3} , {—3}}), 
while 5(F') = 2. 

It is decidable in polynomial time whether F £ MCEAN holds (which follows 
for example by the characterisation of MCEAN via the surplus below). The class 
MCE AN D CEAN of matching-lean multi-clause-sets is not invariant under mul- 
tiplicities: For a multi-clause-set F holds cls(F) e MCE AN => F £ MCE AN . but 
not the other way around: 



Example 2.17 {{?;}} ^ MCE AN, but {2 * {v}} £ MCE AN . and more generally 
{{x \, . . . , x n }} ^ MCE AN for n > 1, while {(n + 1) * {x \, . . . , x n }} £ MCE AN . 
Indeed it is easy to see that for every F £ CCS there is F' £ CCS with cls(E') = F 
and F' £ MCE AN . 

The process of applying autarkies as long as possible to a clause-set F £ CCS is 
confluent, yielding the lean kernel of F (the largest lean sub-clause-set of F, that 
is, U{^' CF:F'£ CEAN}; see [|o[ Section 3]). Computation of the lean kernel 
is NP-hard, since the lean kernel of satisfiable clause-sets is T. But the matching- 
lean kernel of F (the largest matching-lean sub-clause-set of F, that is, (J{F' C 
F : F' £ MCEAN}; see S Section 3]), now obtained by applying matching 
autarkies as long as possible, which again is a confluent process, is computable 
in polynomial time. Note that a clause-set F is lean resp. matching lean iff the 
lean resp. matching-lean kernel is F itself. Due to the polytime computability of 
the matching-lean kernel, which is a sub-clause-set obtained by removing clauses 
redundant in a strong sense, “w.l.o.g.” for the purpose of SAT-decision one might 
consider the inputs as matching-lean. 

Example 2.18 For inputs F £ MCEAN by Theorem f] we have SAT-decision 
in time 0(2 5 ( F " ) • n(F) 3 ) (see JdT) / for generalisations) , and thus SAT-decision for 
inputs F £ MCEAN is fixed-parameter tractability (fpt) in the parameter 5(F). 

We note here (though we won’t use it in this report), that for inputs F £ 
MCEANs—k the computation of the lean kernel can be done in polynomial time 
for fixed k (f&% Theorem 10.3]; this computational problem appears not to be fpt). 

A multi-clause-set F ^ T is matching lean iff for the surplus we have a(F) > 1 
(Jsl], Lemma 7.7j), which is defined as follows (see Subsection 11.1 in [|?| for more 
information; in |)3| a clause-set has “q-expansion” iff a(F) > q ): 



Definition 2.19 For a multi- clause- set F let cr (F) £ Z be defined as the minimum 
of 5(F[ V}) (recall Definition 2.L ) over all 0 ^ V C var(F) if n(F) > 0, while 
a(F) := 0 in case of n(F) = 0. 

Note that for 0 ^ V C var(F) we have 



5(E[y]) = c(F[V\) -\v\= F ( c ) - l y l- 

CeF, var(C)nV/0 



The surplus is computable in polynomial time. Some basic properties of the surplus 
are (for multi-clause-sets F): 

1. a(F) < 5(F[var(F)}) = 5(F \ {_!_}) < c(F). 
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2. For every 0 C V C vai '(F) holds a(F\V]) > <j(F). 

3. For F' < F with var (F') = var(F) we have a(F') < a(F). 

4. — n(F ) < cr(F), and for n(F) > 0 holds 1 — n(F) < a(F) < c(F) — 1. 

Example 2.20 er(A 0 ) = 0, while a(A n ) = 2" — n = S(A n ) for n £ N (since every 
variable occurs in every clause). If we take F £ CCS and some v £ VA \ var (F), 
then <r(-F U{{r>}}) < 0. 



3 Minimally unsatisfiable clause-sets 



In this section we review minimally unsatisfiable clause-sets; see ||13| for an overview, 
while |]58| , jS5) contain recent developments. First the basic definitions and examples 
are given in Subsection 3.1. In Subsection 3.2 we consider in some detail the funda- 
mental process of “saturation” , which is about adding “missing literal occurrences” 
to minimally unsatisfiable clause-sets. Saturation repairs the problem that splitting 
of F £ MU into (v — > 0} * F and (v — > 1} * F may destroy minimal unsatisfiable, 
i.e., (v — > 0) * F MU or (v — > 1) * F £ MU might hold, due to some clauses 
missed to be deleted, and this process is considered in Subsection 3.3. 



3.1 A iU and subclasses 

An unsatisfiable clause-set F is called minimally unsatisfiable, if for every clause 
C £ F the clause-set F \ {C} is satisfiable, and the set of minimally unsatisfiable 
clause-sets is denoted by Ai U C USAT ■ A clause-set F £ MU is called sat- 
urated, if replacing any C £ F by any super-clause C' D C yields a satisfiable 
clause-set, and the set of saturated minimally unsatisfiable clause-sets is denoted 
by SMU C MU. 



Example 3.1 The simplest element ofUSAT\MU is {_L, {!}} , while the simplest 
element of MU\SMU is {{1, 2}, { — 1}, {—2}} (see Example 3.1 for a “saturation”). 



Unsatisfiable hitting clause-sets fulfil UHTT C SMU (see |6^, Lemma 2] for the 
proof). The subsets of non-singular elements (i.e., there is no literal occurring only 
once) are denoted by M.U.' C MU, SAA.U' C SMU , and UHIF' C UH1T- 



Example 3.2 By [F\ j holds MU$ =1 = SMU' s=l = UTTIT$ =1 = {{_L}} , while for 
the characterisation of MUg=i D SMUg=i =U'HZTg=i see also |^, \Ti\ ( As shown 
in |Ij| j, for F £ MUg = \ with n(F) > 0 holds fj,vd(F) = 2. 

We consider the “reasons” for unsatisfiability as given by the elements of MUg=i 
as “noise”, only “masking” the pure contradiction of the only element of MU' s= i = 
{{_L}} (in Section || the elimination of singular variables will be discussed). “Real 
reasoning” starts with deficiency 2: 

Example 3.3 By the elements of MU$ =2 are U P isomorphism precisely the 
F n for n > 2, where 

F n := {{l,...,n},{— 1,..., — n}, {— 1, 2}, . . . , { — (n — 1), n}, {-n, 1}}. 

All F n are literal-regular, with fivd(F n ) = 4. It is easy to see that all F n are 
saturated, and thus MU$ =2 = SMU' s=2 - The only hitting clause- sets amongst the 
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T n are for n = 2, 3, and thus up to isomorphism the elements of WHITf- 2 are 
J-2 , J- "3 , with J - 2 = A 2 and 



F 3 = {{1, 2, 3}, {-1, -2, —3}, {-1, 2}, {-2, 3}, {-3, 1}}. 



We have a(T n ) = 2 = 5(T n ), since any m < n variables occur at least in m different 
binary claus es plus in t he two “ long clauses”. Further properties of the T n we have 
in Examples \ \6.t\, \ 8.“X 9.L See Section 1 in for more information. 



As shown in |g|, Theorem 74], for every F £ MU $ = 2 there is a unique n > 2 such 
that T n “embeds” into F, and this n is called the “non-singularity type” of F. 
So for MU $ = 2 we have identified the (in a sense) unique reason of unsatisfiability, 
the (possibly hidden) presence of a cycle v\ — >■ v n v\ together with the 

assertions, that one Vi must be true and one must be false (only the n is unique in 
general, not the vf). W e wil l come back to the theme of classifying MUs=k in the 
Conclusion, Subsection 15.5. 



By definition, MU disallows multiplicities (since a duplicated clause is the trivial 
logical redundancy), and this also holds for the subclasses SMU and UH.IT (as well 
as for all other subclasses of MU considered here). A fundamental fact is 8(F) > 1 
for all F £ MU (note that every minimally unsatisfiable clause-set is lean), which 
motivates the investigation of the layers MUs=i,MUg= 2 , Sp ecial elements of 



WHIT are the A(V) for finite sets V of variables (recall Lemma 2.13), which are 
the minimally unsatisfiable clause-sets with maximal deficiency for a given number 
of variables, as we will see in Corollary |B. 1 1 . 



3.2 Saturation 

We recall the fact (|^J, ^S2|) that every minimally unsatisfiable clause-set F £ MU 
can be saturated, i.e. , by adding literal occurrences to F we obtain F' £ SMU 
with var (F 1 ) = var (F) such that there is a bijection a : F — >• F' with C C a(C) for 
all C £ F. Since we need to consider saturation in many situations, we introduce 
some special notations for it from (65) (Subsection 2.2). First we introduce the 
notation S (F, C, x) for adding a literal £ to a clause C in a clause-set F: 

Definition 3.4 ( ||65f ) The operation ( adding literal x to clause C in F ) 

S (F, C, x ) := (F \ {C}) U (CU{z}) 6 CCS 

is defined if F £ CCS, C £ F, and x is a literal with var(a;) £ var(F) \ var(C). 

Some technical remarks: 

1. var(S(.F, C, x)) = var(F). 

2. If Ctl{i} ^ F, then c(S(F, C, x)) = c(F), and thus also 8(S(F,C,x)) = 8(F). 

3. For F £ MU we have: 

(a) S(F,C,x) £ MU iff S (F,C,x) is unsatisfiable (since all what happened 
is that a clause has been weakened, i.e., extended). 

(b) If S (F,C,x) £ MU, then c(S (F,C,x)) = c(F) (no subsumption here). 

(c) F is saturated iff there are no C, x such that S (F, C, x) £ US AT ■ 

Example 3.5 For F := {{a, 6}, {a}, {&}} £ MU \ SMU we have S (F,{a},b) = 
{{a, b}, {a, b},{b}} £SMU. 
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A “saturation” of a minimally unsatisfiable clause-set is obtained by adding 
literals to clauses as long as possible while maintaining unsatisfiability (which is the 
same as maintaining minimal unsatisfiability): 

Definition 3.6 (J65[) A saturation F' G SMU of F G MU is obtained by a 
saturation sequence F = Fq, , F m = F' , m G No, such that 

(i) for 0 < i < m there are Ci,Xi with Fi+i = S (Fi, Ci, xf), 

(ii) for all 1 < i < m we have Fi G US AT , 

(in) the sequence cannot be extended (without violating conditions (i) or (ii)). 

Note that n(F') = n(F) and c(F') = c(F) holds (and thus S(F') = 5(F)). If we 
drop requirement (Hi), then we speak of a partial saturation sequence, while 
F' G MU is a partial saturation of F G MU. 

Some technical remarks: 

1. F' G MU is a partial saturation of F G MU iff 

• var(F') = var(F) 

• there is a bijection a : F — >• F' such that for all C G F we have C C a(C). 

2. For a partial saturation sequence Fq, . . . , F m we have I(F m ) = I(F 0 ) + m. 

3. F' is a saturation of F G MU iff F' is a partial saturation of F with F' G 
SMU. 



Example 3.7 A saturation sequence for F := {{a, b, c}, {a}, {i>} , {c}} with m = 3 
is obtained by adding literals b,c to clause {a}, and adding literal c to clause {6}. 

We can perform a partial saturation F S(F,C,x) iff F without C implies 
(logically) Ctlji} (note that C U{a;}, C U{lc} implies C): 



Lemma 3.8 Consider F G MU, C G F, and a literal x with var(ir) G var (F) \ 
var (C). ThenS(F,C,x) is a partial saturation of F if and only if F\{C} 

Proof: First assume that S (F,C,x) is a partial saturation of F, but F \ {C} 

C U{x}. So there is a partial assignment p with ip*(F\{C}) = T but p*{C hJ{x}} = 
{_L} (whence ip(x) = 1). But then we have tp * S (F,C,x) = T. Reversely assume 
^\{C} N C but that S(F, C, x) is not a partial saturation of F. So S(F, C , x) 
has a satisfying assignment due to F G US AT we have <p(x) = 1 and <p * {C} = 
{_!_}. But this yields ip * (F \ {C}) = T and ip * {Cb{5}} = {_L}. □ 



See Lemma 3.5, Part ^|, for another characterisation of partial saturations. The 
dual notion of “saturated” is “marginal”: F G MU is marginal iff replacing any 
clause by a strict subclause yields a clause-set not in MU. The decision “F marginal 
minimally unsatisfiable ?” for inputs F G CCS is D p -complete ([|f| Theorem 2]). 
By Q, Theorem 8] however this decision is easy for inputs F G SMU , namely 
there is the following characterisation of minimally unsatisfiable clause-sets which 
are marginal and saturated at the same time: 



Lemma 3.9 (p4[) F G MU is marginal and saturated iff F = A(var(F)). 
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Thus F £ SMU is marginal iff F = A(var(.F)); so if F £ SMU is not full, 
then there is a literal occurrence which can be removed without destroying minimal 
unsatisfiability, that is, there exists a clause C £ F and x £ C such that F' := 
( F \ {C}) \ {a;}} £ MU (note that F' £ US AT in any case, but minimality in 

general is not maintained). And for inputs F £ SMU the existence of such C, x is 
decidable in linear time (namely they exist iff F is not full). But finding such C, x 
should be hard in general, that is, the decision problem, whether a concrete literal 
can be removed, even for inputs F £ SMU should be NP-complete: 

Question 3.10 Is the promise problem for input F £ SMU , C £ F, x £ C, 
whether “(F \ {C}) U{C \ {a;}} £ MU ?”, NP-complete? (That is, is there a 
polytime computation G £ CCS (F, C,x) £ SMU x CC x CTT , with x £ C £ F , 
such that G £ SAT ^{F\ {C}) U{C \ {a:}} 6 MU ?) 

And is the promise problem for input F £ MU , whether F is marginal, NP- 
complete? (That is, is there a polytime computation G £ CCS F £ MU, such 
that G £ SAT iff F is marginal?) 

Back to saturation: precisely all saturated clause-sets except the A(V) are ob- 
tained as non-trivial saturations of some minimally unsatisfiable clause-set: 



Corollary 3.11 Consider F £ SMU. 

1. F is trivially the saturation of itself. 

2. If F = A(vax(F)), then this is also the only possibility for F being a saturation, 
that is, if F is the saturation of some F' £ MU, then we have F' = F. 

3. Otherwise F is a saturation of some clause-set other than it itself, that is, if 
F ^ A(var(P)) ; then there is some F' £ MU with F' ^ F such that F is a 
saturation of F' . 

Proof: Part |l] is trivial. For Part assume that F = A(var(.F)), and we have 
F = S (F',C,x) for some F' £ MU: But since F is marginal, F' is not minim ally 
unsatisfiable. Finally for Part || note, that if F ^ A(var(F)) : then by Lemma [l.9| 
F is not marginal, and thus there is C £ F and x £ C such that for C' := C\ {x} 
and F' := (F \ {C}) U{C"} we have F' £ MU. Now F = S {F' , C' , x). □ 



As discussed above, we expect the decision whether for inputs F £ SMU , 
C £ F and x £ C we have (F \ {C})U{C"} £ MU to be NP-complete. But 
this decision is easy for F £ WHIT , namely iff no “subsumption resolution” with 
another clause containing x can be performed, that is, there is no D £ F with x £ D 
and C \ {a;} C D: 



Lemma 3.12 Consider F £ UTIT , C £ F and x £ C. Let C' := C\ {a;}, and let 
F' := (F \ {C}) U{C"}. Then we have F' £ MU iff there is no D £ F \ {C} with 
C' C D. 

Proof: If there is D £ F \ {C} with C' C D, then F' £ MU. So assume there is 
no such D. Assume F' £ MU. Thus there is E £ F' with F' \ {E} unsatisfiable. 
We must have E T C' , since otherwise F \ {C} would be unsatisfiable. Since F 
is hitting, E clashes with every clause of F' \ |C"}. It follows that C C E must 
hold (since the falsifying assignments for E are disjoint with those for any clause in 
F' \ {C"}), contradicting the minimal unsatisfiability of F. □ 



Some examples on removable literal occurrences illustrate Lemma 3.12 
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Example 3.13 For F := {{a, b}, {a, b}, {5}} G WHITs=\ we can exactly remove 
one of the two literal- occurrences ofb and still obtain a clause-set in MU (of course 
not in WHIT anymore; the resulting clause-sets are in fact marginally minimally 
unsatisfiable) . For T 2 = A 2 = {{ 1 , 2}, { — 1, —2}, { — 1 , 2}, {—2, 1 }} G WHIT, ^_ 2 we 
can not remove any literal occurrence without leaving MU (i.e., T 2 is marginal) . 



3.3 Splitting 

An important (equivalent) characterisation of saturation for F G CCS, as shown in 
|58| , is that splitting on any variable v will yield minimally unsatisfiable clause-sets 
(v — > 0) * F, (v — > 1) * F. This enables induction on the number of variables, 
which is a central method for this report; see Lemma 8.1 for the basic example. We 
also have that if for some variable both splitting results are minimally unsatisfiable 
resp. saturated, then this can be lifted to the original clause-set, provided that no 
contraction takes place. These basic facts are collected in the following lemma. 



Lemma 3.14 For all clause-sets F G CCS we have: 



1. F G SMU iff for all v G var(F) and all e G {0, 1} we have ( v — t e)*F G MU. 

2. If for some variable v holds (v -4 0) * F G MU and (v -4 1) * F G MU, and 

if for all C G F with v G var(C) we have C \ {u,u} ^ F, then F G MU. 

3. If for some variable v holds (v — > 0) * F G SMU and (v -4 1) * F G SMU, 

and if for all C G F with v G var(C) we have C\{u,u} ^ F, then F G SMU. 

Proof: Part |l| is Corollary 5.3 in j5^. For Part |2| assume F £ MU; thus there is 
C G F with F' := F\ {C} G US AT. We consider three cases: 

1 . v var(C): Due to the assumption on subsumption-freeness we have C U{u} ^ 
F 1 . Now C G (v -4 0) * F, while ({v -4 0 )*F)\ {C} = («-t0)*f'G US AT, 
contradicting (v — > 0) = 1 = F G MU. 

2. v G C: By assumption holds C' := C\{u} ^ F' . Now C' G (v — > 0) *F, while 
({v -4 0)*F)\{C"} = (v -4 0 )*F' G USAT , contradicting (v -4- 0 )*F G MU. 



3. v G C: By assumption holds C' := C\{u} ^ F’ . Now C' G {v — > 1) *F, while 
({v -4 1)*F)\{C"} = (v -4 1)*F' G USAT, contradicting (v -4 1 )*F G MU. 

Finally consider Part ||. By Part || we already know that F G MU holds. Assume 
that F SMU; thus there is C G F and a literal x with F' := S(F, C,x) G USAT. 
So by Lemma T8 we have F \ {C} |= C := CU{f}. There exists at least one 
e G {0, 1} with (v — > e) * {C'} T T, and then (v — > e) * (F \ {C}) \= (v — > e) * C' . 
If var(a;) = v, then this contradicts minimal unsatisfiability of (v — > e) * F. And if 
var(a;) T v i then (v — > e) * F \ (v — > e) =t= {C} \= ((v — > e) * C) dJ{a;}, contradicting 
saturatedness of (v — > e) * F by Lemma 3.8. □ 



The additional assumption C \ {v,u} ^ F for Parts ^ is equivalent to saying 
that when applying (v 4 0), (d 4 1), then no contraction takes place. An alterna- 
tive way of stating this would be to use multi-clause-sets, since then no contractions 
would be performed, and the doubled clauses would destroy minimal unsatisfiability. 
In |62|] (Lemma 1) (and in the underlying report ]63|], Lemma 2.1) that additional 
assumption for Parts |2j. ^ is missing by mistake: 



Example 3.15 An example for (v — > 0) * F G UH.IT and (v — > 1) = 1 = F G WHIT, 
but F MU is trivially given by {_L,{u},{u}} (note that F is a set — for a 
multi- clause- set F the contraction would not occur). 
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4 Variable-minimal unsatisfiability 

In |ll| the generalisation of minimal unsatisfiability to “variable-minimal unsatisfia- 
bility” has been introduced, and the class of all such clause-sets is denoted by VMU , 
the set of clause-sets F £ US AT such that for every F' C F with F' £ US AT 
holds var(F') = var(F). The corresponding class VMU of multi-clause-sets is 
invariant under multiplicities. Thus, as with CSAAf (and different from MU ), re- 
garding variable-minimal unsatisfiability w.l.o.g. multi-clause-sets can be cast down 
to clause-sets. The basic (trivial) characterisation of VMU is: 



Lemma 4.1 For F £ CCS holds F £ VMU if and only if F £ US AT and for all 
v £ var(F) holds {C £ F : v £ var(C)} £ SAT ■ 

By definition we have MU C VMU, moreover, as shown in Lemma 6 of fll| , 
for every deficiency k > 2 we have MUs=k C VMUs=k (for example, for every 
F £ MU$=k, k £ N, and every non- full clause C £ F, i.e., var(C) C var(F), we can 
add to F a full clause subsumed by C, obtaining F' £ VMUs=k+\ \ MUg=k+i)- 

In 0 there is the false statement “ VMU % CSAAf" , based on following erro- 
neous example: 

Example 4.2 / | 77 | . Page 266] gives the example F 4 := {{a}, {b}, {a, b}, {a, b}} with 
the assertion “ F . 4 £ VMU \ CSAAf " '. Obviously we have F 4 £ VMU, but we 
also have F 4 £ CSAAf . Using the characterisation from J51 \] (which is the only 
characterisation used in that F £ CSAAf holds iff every clause of F can be 

used in a tree-resolution refutation of F, we see this as follows: the sole subset of 
F 4 in MU is {{a}, {6}, {a, 6}}, while the clause {a,b} can(!) also be used in a 
tree-resolution refutation — it is obviously superfluous, but nevertheless there is a 
tree-resolution refutation using it, namely via ({a} o{a, &}) o{a, 6} = {a}. 

Based on the characterisation of lean clause-sets via autarkies, it is easy to show 
that VMU consists of special lean clause-sets (thus Figure 1 in |lj| needs to be 
corrected, showing instead that CSAAf is indeed a superclass of VMU): 



Lemma 4.3 VMU C CSAAf \ {T} . 



Proof: While in jTT| the characterisation of CSAAf via variables usable in resolu- 
tion refutation was (only) used, here we need to use the equivalent characterisation 
via autarkies, shown in Theorem 3.16 in ]5(|, and used as our definition in Sub- 
section 2T, namely that for F £ CCS holds F £ CSAAf iff there is no autarky p 
for F with var(tp) n var(F) 7^ 0: if we had such an autarky for F £ VMU, then 
ip * F £ US AT with ip * F C F and var(<p * F) C var (F) \ var(F), contradicting 
F £ VMU. That we indeed have a strict subset can for example be seen by Lemma 
3.2 in |k|, which shows that if we extended a minimally unsatisfiable clause-sets via 
Extended Resolution, then we always stay in CSAAf', another example is clause-set 
F3 on Page 266 in 0. □ 



Thus it follows VMUs=i = MUs=i (shown in Lemma 6 in 0), since by Corol- 
lary 5.7 in |5(J holds CSAAf$=i C\USAT = MUs=i- 

For F £ VMU obviously there is some F' C F with var(F') = var(F) and 
F' £ MU\ Lemma 5 in 0] asserts the converse, but this is false, as the following 
simple example shows: 



Example 4.4 Consider F := {_L, {r;}, {u}} and F' := {{u}, {u}}; we have F' £ 
MU and var(F') = var(F), but F VMU, since {_L} £ US AT . If we don’t want 
to use the empty clause, then we can consider any F' £ MU with v £ var(F') and 
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{{zj} , {«}} n F' = 0, and let F := F' U{{z>}, {t<}} — again we have F' £ MU and 
var(F') = var(F), but F £ VMU. 

The corrected version of Lemma 5 from [|lT] is as follows: 

Lemma 4.5 For F £ CCS let Uf be the set of F' C F with var (F r ) = vax(F), 
S(F') > 1, and F 1 £ US AT- Then F £ VMU if and only if F £ Uf and all 
minimal elements o/Uf w.r.t. the subset-relation are minimally unsatisfiable. 

Proof: The condition is necessary, since if F £ VMU, then on the one hand we 
have F £ C£AAf\ {T}, and thus 6(F) > 1 by |w| (or use Lemma 3 in |[TT|l ) ; and 
on the other hand if there would be a minimal element F' £ HJf which wouldn’t 
be minimally unsatisfiable, then there would be some F" C F' with F" £ MU, 
whence by definition of Uf we get var(E") C var(E) contradicting F £ VMU. 

For the other direction assume, that we have Uf as specified, and we have to 
show F £ VMU. Since F £ Uf, we have F £ US AT. Consider now some F 1 C F 
with F' £ US AT, and assume var(F') C var(F). Consider some minimal F" £ Uf 
( regarding inclusion) with F' C F" C F . Furthermore consider a minimal element 
G £ Uf with G C F"; by assumption G £ MU, and since F' C F" , we have 
G C F" . If for C £ F" we had var(E" \ {C}) C var(F"), then there would be 
x £ C such that x or x is pure in F" , thus also pure in G, whence C G (since 
G £ MU), contradicting var(G) = var(F). Now choose some C £ F" \ F' (we 
have var (F" \ {C}) = var(F")); by minimality of F" we now have 8(F" \ {G}) < 0 
(otherwise all conditions for Uf are fulfilled for F" \ {G}), whence 8(F") = 1. 
But then due to var (F") = var (G) and G C F" it follows 8(G) < 0, contradicting 
G £ MU. □ 



The following examples show applications of Lemma 4.5 



Example 4.6 Consider the two (non-) examples from Example 

1. For F = {_L, {?;}, {z7}} we have the minimal element {T, {u}} of Uf which is 
not minimally unsatisfiable. 

2. For F = F' ld{{?;}, {zJ} } (note that from the assumptions follows var(F') D 
{njj consider a minimal F" C F' with var(G) = var(E) and 8(G) > 1 for 
G := F"U{{z},{n}} (note T C F" C F): now G is a minimal element of 
Uf which is not minimally unsatisfiable. 



Based on Lemma 5 in jnj, also the proof of Theorem 3 in [^l) is false (the 
procedure goes astray on the clause-sets of Example 4.4). Fortunately we can give 
a simple proof of the assertion, which even shows fixed-parameter tractability (fpt) 
of the decision problem “F £ VMUs=k ?” in the parameter k: 



Theorem 4.7 Membership “F £ VMUg-k ?” for input F £ CCS is fpt in the 
parameter k £ Z. 



Proof: If F £ MCSAAf, then F VMU (by Lemma 4.3). So we can assume now 
F £ MCSAAf, and thus we have S(F') < 8(F) for all F' C F. Now the decisions 
of Lemma 4.1, as discussed in Example |2.18, are fpt in k. □ 
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5 Eliminating and creating singularity 



In this section we continue the study of the handling of singular variables in min- 
imal unsatisfiable clause-sets 
reduction process 



as initiated in |6j, |55|. In Section 5.1 we study the 
eliminating singular variables. A main insight is Lemma 5.4, 



showing that the elimination is harmless concerning the minimum variable degree. 
In Subsection 5.2 we introduce the inverse elimination (“extension”); the mai n po int 
here is the precise statement of the various conditions. Finally in Subsection 5.S we 
consider a special case of singularity, namely unit-clauses. 



5.1 Singular DP-reduction 

In Q (Section 3) the process of “singular DP-reduction” has been studied for 
minimally unsatisfiable clause-sets. By it we can reduce the case of arbitrary 
F £ MU to (non-singular) F' £ MU' (that is, for every v £ var(F') we have 
ldF'(w), ldF'(^) > 2). The definition is as follows (see Definition 8 in |65||): 

Definition 5.1 (|]65[) The relation F ‘ S - P - P > F' (singular DP-reduction) holds for 
clause-sets F,F' £ CCS, if there is a singular variable v in F, such that F’ is ob- 
tained from F by DP-reduction on v, that is, F' = DP„(.F). The reflexive-transitive 

sDP 

closure of this relation is denoted by F - — >■* F' . 

oTA P 

By sDP(F) C MU for F £ MU the set of non-singular F' £ MU with F - — >* F' 
is denoted. For us the main property of sDP(F) is that it is not empty. In |65) it is 
shown that for S £ SMU we have |sDP(F)| = 1, and that for arbitrary F £ MU 
and F', F" £ sDP (F) we have n(F') = n(F"). 

Example 5.2 In the following is shown for F £ MU: 

1. For 6(F) = 1 we have sDP(F) = {_L}. 

2. For 5(F) = 2 all elements of sDP(E) are isomorphic. 

3. For 5(F) > 3 in general there are non-isomorphic elements in sDP(F’). 

By the results of Sections 3.1, 3.2 in ||| we have the following basic preservation 
properties: 

Lemma 5.3 (|]65[) For F,F' £ MU with F sDP > * F' we have: 

1. 5(F') = 5(F). 

2. F £ MU => F' £ MU. 

3. F £ SMU => F' £ SMU. 

4. f £ uuit => f' £ umr. 

Although singular DP-reduction can reduce the variable-degree of some vari- 
ables, it can not decrease the minimum variable-degree: 

oJ~)P 

Lemma 5.4 For F, F' £ MU with F >■* F' we have /rvd(F') > p,vd(F). 
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Proof: It is sufficient to consider the case F' = DP„(P) for a singular variable v. 
Assume pvd(F') < /rvd(F); thus var(F') ^ 0 (otherwise we have pvd(F') = +oo), 
and we consider w £ var (F 1 ) with vd f'{w) = /rvd {F'). So we have vdF'(w) < 
vdF(’u ; )i and thus by Lemma 24 in |^5|, for all clauses C £ F with v £ var (G) we 
have w £ var(G). But then gvd(F') = vd f'{w) > vdF(u) > /rvd(F) > /rvd(F'), a 
contradiction. □ 



Thus in order to determine the minimum variable-degree for minimally unsatis- 
fiable clause-sets in dependency on the deficiency, w.l.o.g. one can restrict attention 
to saturated and non-singular instances: 



Corollary 5.5 For all k £ N holds: 



1. nvd(MUs=k) = gvd(SMU' s=k ). 

2. ^d(uniTs= k ) = ^dipmiTLk)- 



Proof: For Part |]we note that by Lemma A4 for every F £ MUg=k we can find 
F' £ SMU) =k with /xvd(F') > gyd(F), and thus gvd(MUs=k) < lxvd(SAAUg =k ), 
while fivd(MUs^k) > /rvdbSA 4Ug =k ) holds due to SAAU' s= k c MU$ = k ■ The same 
reasoning applies for Part 0. □ 



See Lemma 15.9 for some conditions under which fceN 
k £ N i — >■ n\d(MUs=k) would be computable. 



gvd(UHTT5=k) and 



5.2 Singular DP-extensions 

We consider now the reverse direction of singular DP-reduction, from DP„(F) to F, 
as a singular extension , and also generalise it to arbitrary clause-sets. This process 
was mentioned in |i5|, Examples 15,19,54] for minimally unsatisfiable DP„(F), called 
“inverse singular DP-reduction” there: 

Definition 5.6 Consider a clause-set G £ CCS , a variable v £ "H4\var(G), and 
m £ N. A singular m-extension of G with v is a clause-set F £ CCS obtained 
as follows (employing four choice steps): 

1. to different clauses D i, . . . , D m £ G are chosen. 

2. A subset C C D[ is chosen. 

3. A literal x with var(x) = v is chosen. 

4- Clauses D[ £ CC for i £ {1, . . . , m} with ( Di \ C) C D[ C D, are chosen. 

5. Let C' := C U{:r}, and let D" := D\ U{x} for i £ {1, . . . , to}. 

6. F is obtained from G by adding C' and replacing the Di with D " : 

F ■= (G\ {Di , . . . , D m }) u{ C', D ", . . . , D" }. 

Example 5.7 Consider G := {{a, 6, c}, {a, 6, c}}, m := 2, and the choices C := 
{a}, x := v, and D[ := {6,c}, D' 2 := {a,6,c}. Then the 2-extension F of G is 
F = {{u,a},{F,6,c},{F,a,6,c}}. 

By definition we have for an w-extension F of G £ CCS with v the following 
simple properties: 

1. c(F) = c(G) + 1, n(F) = n(G) + 1, 6(F) = 6(G). 
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2. v is singular for F, vc1f(u) =m+l. 

3. DI\.(F) = G. 



We now show that indeed the process of Definition 5.6 is precisely the inversion 
of singular DP-reduction: 



Lemma 5.8 Consider m £ N, G, F £ CCS and v £ VA. Then F is an m-extension 
of G by v iff the following conditions are fulfilled: 

1. v is singular for F ; 

2. vdp{v) = to + 1; 

3. DP„(F) = G, c(G) = c(F) - 1. 

Proof: If F is an m-extension of G by v, then the three properties hold, as we have 
already mentioned. So assume these three properties hold. Now let the to clauses 
D i , . . . , D m be the result of singular DP-reduction on v for F; they must be pairwise 
different, and all to resolutions must be possible, otherwise c(G) < c(F ) — 1. And 
let C be singular occurrence of v minus the variable v. Now all properties of a 
singular m-extension are easily checked. □ 



Singular extensions behave well regarding minimal unsatisfiability: 



Lemma 5.9 Consider m £ N, G £ CCS and an m-extension F of G by v £ VA. 
Then F £ MU &G £ MU. 

Proof: This follows by Lemma together with Lemma 9, Parts 1, 2 in |35). □ 

In the situation of Lemma | r ).9[ . regarding saturatedness we only have the direc- 
tion F £ SMU => G £ SMU, while for the other direction the conditions of p5| , 
Lemma 12] need to be observed (this would yield “saturated extensions”, which 
however we do not need here). 



5.3 Unit clauses 

We conclude this section by considering unit-clauses in minimally unsatisfiable 
clause-sets. The following (fundamental, simple) lemma is proven in | |65{ (Lemma 
14); there in Subsection 3.3 one finds further information on unit-clauses in mini- 
mally unsatisfiable clause-sets. 



Lemma 5.10 ([65 ) 



Consider F £ MU. 



1. If v is full and singular in F , then we have {v} £ F or {u} £ F. 



2. If {x} £ F, then v := var(x) is singular in F (with kipO^) = 1). If here F is 
saturated, then v is also full in F . 



So unit-clauses in minimally unsatisfiable clause-sets are strong cases of singular 
variables. They can obviously be removed by singular DP-reduction, while singular 
> 2-extensions can not remove all unit-clauses: 



Lemma 5.11 Consider a clause-set F £ MU containing at least one unit- clause, 
and obtain F' from F by a singular m-extension, where to > 2. Then also F' must 
contain at least one unit-clause. 
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Proof: For a unit-clause {cc} £ F to be removed in F', it needs to be one of the Di 
(using the terminology of Definition |5.C| ) . Then the intersection C must be empty 
(otherwise any other Di needed to contain x, and since m > 2 this would mean a 
subsumption in F). Thus the extension introduces the new unit-clause C" . □ 



The following examples shows that the assumptions F £ AilA and m > 2 in 
Lemma 5.11 are needed: 



Example 5.12 First consider F := {{a}, {a, b}, {a, 6} } £ AAU. Via a 1-singular 
extension we obtain F' := {{u, a}, (v, a}, {a, &}, {a, &}}, which has no unit-clauses. 
A 2-singular extension of F, which touches {a}, has C = _L, and thus C' is a new 
unit-clause. If on the other hand we consider F := {{a}, {a, b}} £ CCS\M.U, then 
F' := {{u, a}, {U, a}, (U, 6}} is a 2-extension without unit-clauses. 

For certain F £ MUs ^2 the existence of a unit-clause is actually necessary for 
singularity: 



Lemma 5.13 Consider F £ MlA ^—2 with gvd (F) > 4. Then F is singular if and 
only if F contains a unit-clause. 



Proof: That if F contains a unit-clause, then F must be singular, follows by 

Lemma 5. 1C|, Part p|. So assume now that F is singular, and we have to show that 

s£)p sDP 

F contains a unit-clause. Consider a reduction sequence F = F 0 > F\ > 

sDP 

. . . > F m , where F m is non-singular (note to > 1). So there exists n > 2 such 



that F m is isomorphic to T n (r ecall Example |3.3| ), and thus every variable of F m 
has degree 4. So by Lemma |5T| we know /xvd(-Fj) = 4 for i £ {0, . . . , to}. We show 
by induction on to that F contains a unit-clause. If to = 1, then in order to obtain 
the min-var-degree of at least 4, at least 3 side-clauses D\, ... , D 3 £ T n for the 
singular extension have to be chosen (using Definition 5.6), but every literal occurs 
precisely twice in T n (because of variable-degree 4 and non-singularity), and thus 
the intersection C has to be empty, and the new clause introduced by the singular 
extension is a unit-clause, whence F contains a unit-clause. Finally assume to > 1. 
So by induction hypothesis, F\ contains a unit-clause, and thus by Lemma [5.11 also 
Fq contains a unit-clause. □ 



We will later see (Theorem fO) that the condition fxvd(F) > 4 in Lemma 5.13 



is equivalent to gvd(F) = 4; the following examples show that this condition can 
not be improved: 



Example 5.14 A 1-singular extension of A 2 is 

Fi := {{1, 2, 3}, {1, 2, -3}, (-1, 2}, {1, -2}, {-1, -2}} £ MU S=2 , 

where F\ is singular, has no unit-clause, and /ivd(i 7 'i) = 2. While a 2-singular 
extension of A 2 is 

F 2 := {{1, 3}, {1, 2, -3}, {1, -2, -3}, {-1, 2}, {-1, -2}} £ MU S =2 , 

where F 2 is singular, has no unit-clause, and ^vd(F 2 ) = 3. 

We conclude with a simple form of adding a new variable, by adding it in one 
sign as unit-clause, and adding it in the other sign to all given clauses: 

Definition 5.15 A full singular unit- extension of a clause-set F £ CCS (by 

unit-clause { 2 ;} ) is a clause-set F' £ CCS obtained from F by adding a unit-clause 
{to} with var(x) ef var(F'), and by adding literal x to all clauses of F, i.e., F' := 
{{i}}tJ{Ctl{f} : C £ F} for some x £ CTT\dt(F). 
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A full singular unit-extension F' of F by {a;} is a case of a singular c(F)-extension 
of F with var(a;), and thus F' sDP > F.p^| 

Example 5.16 Starting from {_L}, the (up to the choice of the new literal) first full 
singular unit- extension is {{u}, {v}}, the second one is {{w}, {v,?U}, {u, w}}. In this 
way we get special examples of SMUs-i (since we started with {_L} £ SMU$ = \). 

If we start with T instead, then first we get {{u}}, and then {{u>}, {v, w}}. 

Example 15, Part 1, in contains two example of “inverse unit elimination”, 
where Example (a) there is an example of a full singular unit- extension, while Ex- 
ample (b) there would be a non- full singular unit- extension (where the new variable 
is not full; not used in this report). There is the dual notion of a “full variable” 
for a clause-set F, which is some element of flce-F var(C) ; which explains why we 
speak of a “full extension” (namely the new variable is full). 

The process of full singular unit-extension of a clause-set F maintains many 
properties of F, and we list here those we use: 

Lemma 5.17 Consider a full singular unit- extension F' of F (by {v}): 

1. n(F') = n(F) + 1 and c(F') = c(F) + 1. 

2. S(F') = 6(F). 

3. a(F') = a(F ) forF^{±}. 

4- ^ivd(F') = /xvd(-F) for n(F ) > 0. 

5. F' is satisfiable iff F is satisfiable. 

6. For F/T: F' is lean iff F is lean. 

7. F' is (saturated) minimally unsatisfiable iff F is (saturated) minimally unsat- 
isfiable. 

8. F' is hitting iff F is hitting. 

Proof: Parts [j], || follow directly by definition. For Part |J we notice that for F = T 
we have cr(F') = a(F) = 0, while for n(F) > 0 consider 0 C V C var(F'): if v ^ V , 
then = F[P], and thus the minimisation for a(F) is included in a(F'), and 

if v £ V, then (5(F'[F]) = c(F') — |F| > S(F') = 6(F) > a(F), and thus these V do 
not contribute to the minimisation. 

For Part|4]we just note that the variables of F keep their degrees in F' , while the 
new variable has degree vd^/(u) = c(F') > c(F), and thus does not contribute to 
the min-var degree. Part || is trivial, and follows also by the satisfiability-equivalence 
of DP„(F) and F. For Part |(i| we note, that an autarky for F' involving v must 
be a satisfying assignment for F', while the autarkies for F' not involving v are 
the same as the autarkies for F. Part |?] concerning (just) minimal unsatisfiability 
follows with Lemma 9 in ^5|, while regarding saturatedness we can use Lemma 12 
in {55| (both assertions also follow easily by direct reasoning). Part |§| is trivial. □ 

So our fundamental classes are respected by full singular unit-extension: 



Corollary 5.18 If F £ MUs-k (k £N), then every full singular unit-extension is 
also in MlAs=k- If furthermore F is saturated resp. hitting, then every full singular 
unit- extension is also saturated resp. hitting. 



12) The case m = 0 is excluded in Definition 
the formulation. 



5.6, since it is not needed, and would only complicate 
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Obviously, full singular unit-extension is unique up to isomorphism: 

Lemma 5.19 Consider a clause-set F £ CCS and clause-sets F',F” £ CCS ob- 
tained from, F by repeated full singular unit- extensions. Then F' , F" are isomorphic 
if and only if n(F') = n(F"). 

Proof: The number of repeated full singular unit-extensions leading to F' resp. 
F" is the number of variables in these clause-sets with degree strictly greater than 
c(F), and sorting these variables by increasing degree yields the sequence of exten- 
sions. Thus just from knowing the number of variables in F' , F" we can reconstruct 
them up to isomorphism (using that a full singular unit-extension of F by {a;} is 
isomorphic to one by {y}, for arbitrary literals x,y with new variables). □ 



6 Full subsumption resolution / extension 



In this section we investigate the second reduction concept for this report, “full sub- 
sumption resolution” . As with singular DP-reduction from Section |s|, in general the 
reduction uncovers hidden structure, while the inverse process, “full subsumption 
extension” , serves as a generator for minimally unsatisfiable clause-sets with various 
properties. However in this report, unlike with singular DP-reduction, we will not 
consider full subsumption resolution for arbitrary F £ MU, but only starting from 
some A(V), while a deeper use will be important in Q. Subsection |6.l| discusses 
the basic definitions (there are various technicalities one needs to be aware of), and 



first applications are given in Subsection 3.2 



The basic idea is, for a clause-set F containing two clauses R U{u}, f?U{u} £ F, 
to replace these two clauses by the clause R, i.e. , we consider the case where the 
resolvent R of parent clauses C, D subsumes both parent clauses (thus the name). 
This is a very old procedure, based on the trivial observation that (R V v) A (f? V -<v) 
is logically equivalent to R. If we perform this in the inverse direction, as an “ex- 
tension” , then every clause-set F £ CCS can be transformed into its “distinguished 
CNF” F' C A(var(F)) (just expand every non-full clause), which is uniquely deter- 
mined. We however have to be more careful about deficiency and membership in 
MU, and thus will consider only “full subsumption resolution”, where the resolvent 
must not be present already, while for the “strict” form additionally the resolution 
variable v must occur also in other clauses. Then from A(V) by strict full subsump- 
tion resolution we can obtain precisely the F £ WHIT with var(F) = V (Lemma 
3.£). For the inverse forms, we have to be even more carefully, making sure that 
neither any of the two parent clauses is already present (this prevents the above 
expansion of arbitrary F £ CCS to A(var(F))). 

The (more general) well-known “subsumption resolution” is the reduction F ^ 
(F \ {C}) U {C \ {a;}} for F £ CCS, that is the removal of a literal x £ C from a 
clause C £ F, in case there exists D £ F with x £ D and D \ {x} C C (note that 
CoD = C\{:r} subsumes C). An early use is in Q (under the name “replacement 
principle”), while the terminology “subsumption resolution” is used in p9[ (for 
SAT solving). The earliest sources with a systematic treatment appear to be p9| , 
Section 7] and [ |60| Section 7]. An experimental study of the practical importance 
of subsumption resolution in connection with DP-reductions F DP,, (_F) (under 
suitable additional conditions to make DP-reduction feasible; see Subsection 1.3 in 
H for an overview on such restrictions) is performed in [ po[ (under the name of 
“self-subsuming resolution”), continued in [^7|. A theoretic (similar) use one finds 
in El Section 4], where a variable v is called “DP-simplicial” for F £ CCS iff all 
resolutions performed by the reduction F DP t ,(F 1 ) are subsumption resolutions. 
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Our special form, where both parent clauses are subsumed we call full sub- 
sumption resolution , namely the reduction F (F \ {C, D}) t){CoD} in case of 
C,D £ F such that C D D = { 2 } and C \ { 2 } = D \ { 2 }. A main tool is Lemma 
|3.5| , where especially Part is somewhat subtle, and can be easily overlooked. Via 
this tool we have a controlled way of transforming F £ MU resp. F £ WHIT 
into A(var(F)), and in Theorem S.13 this yields the determination of the possible 
numbers of variables and clauses in minimally unsatisfiable clause-sets of a given 
deficiency. 



6.1 Basic definitions 

Before defining “full subsumption reduction” F (F \ {t?.U{u}, AU{u}}) U{i?} 
in Definition 5.7 (so R is new and the two clauses i?ld{u} vanish), we 

introduce the “strict” form, which is more important to us, and which has the 
additional condition that v must still occur (in other clauses of F ; the “non-strict” 
form on the other hand guarantees that v vanishes (see Definition 6.7)): 



Definition 6.1 For clause-sets F,F' £ CIS by F - - R > F' we denote that F' is 
obtained from F by one step of strict full subsumption resolution , that is, 

• there is a clause R £ F' 



• and a literal 2 with var( 2 ) ^ R 

• such that for the clauses C := i?U{ 2 } and D := i?U{ 2 } 

• we have F = ( F' \ {A}) U{C, D}; 

• we furthermore require var( 2 ) £ var (F 1 ). 

• As usual, the literals 2,2 are the resolution literals, var( 2 ) is the resolution 
variable, C, D are the parent clauses, and R is the resolvent. 

We write F s - s —>k F' for k £ No if exactly k steps have been performed, while we 
write F s ^ sR >^ F' for an arbitrary number of steps (including zero). 

We require R ^ F, that is, the (full subsumption) resolvent is not already present 
in the original clause-set. This is of course satisfied if F £ MU. We also require 
that the variable v does not vanish, for the sake of keeping control on the deficiency. 



Example 6.2 Some simple examples are: 

1. T 2 = {{1, 2}, (-1, -2}, {-1, 2}, {-2, 1}} ^ {{2}, {-1, -2}, {-2, 1}}, and 
no further reduction is possible ( note that the only possibility is blocked, since 
variable 1 would vanish). 

2. {{ 2 }, { 2 }} f sR > {-L}, as v vanishes, ii)He {{«}, {«}, {«, 2 }} {1, {«,i}}. 

&J~sR 

3. {{u, w}, { 2 , w}, { 2 , W}} / > {{ 2 , w}, {{rc}, { 2 , uJ}}}, as one parent clause is 

kept, while {{2,2:}, {2,^}, {2,72}} s - sR > {{{w}, {2,2;}}}. 

4- {{2, 2;}, {2, 21}, {2, W}, {2}, {22} } can not be reduced by strict-full-subsumption 
resolution, since all possible resolvents are already there. 

The expansion of a clause R to two clauses i?U{ 2 }, i?hl{ 2 } under the above 
requirements is called “extension” : 
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Definition 6.3 For clause-sets F,F' £ CCS we say that F is obtained from F' 

by strict full subsumption extension if F s - s -> F' . And for k £ No we say that 
F is obtained from F' by strict full subsumption extension with k steps 

So one step of strict full subsumption extension for a clause-set F uses a non-full 
clause R £ F and a variable v £ var (F) \var(i?), and replaces R by the two clauses 
i?U{v}, f?U{u}, where none of them is already present. 



Example 6.4 From {{a,}, {£<}} by one step of strict full subsumption extension we 
can obtain {{a, 6}, {a, b}, {6}} or {{a}, {a, b}, {a, 6}}; note that no new variable has 
been introduced, that the original clause ({a} resp. {b}) vanished, and that its re- 
placement clauses were not already present. For {{a, 6}, {a}} no strict full subsump- 
tion extension is possible. Further examples are obtained by “reading Example \ 6.\ 
backwards”. 

The basic properties of strict full subsumption resolution are collected in the 
following lemma. 



sfsR 

Lemma 6.5 For clause-sets F,F' £ CCS with F y- F' (k £ No, with resolution 

variable v and resolvent R) we have: 

1. F' is logically equivalent to F . 

2. var(F') = var(F). 

3. c(F') = c(F) - k, S(F') = 6(F) - k. 

4. yvd(F) > yvd(F'). 



5. 

6 . 



7. 

8 . 



F £ MU => F' £ MU. 

If k = 1 and F' £ MU, then exactly one of the following three possibilities 
holds: 



(a) S (F',R,v) is a partial saturation of F' ( recall Definition 3.6). 

(b) S (F',R,v) is a partial saturation of F' . 

(c) F £ MU. 

F £ SMU => F' £ SMU. 

F £ UTT & F' £ U1T. 



Proof: Parts [0, [| []] follow directly from the definition. Parts 10 hold since we 

strengthen two clauses into one, which on the other hand is logically equivalent to 
its parent clauses. Part | follows by trivial combinatorics. 

Now consider Part p . That the two possibilities for partial saturation exclude 
each other follows by Lemma 3.8 (and F' \ {f?} R). And that each possibility 

for partial saturation excludes F £ MU follows by definition. Finally, that the 
negation of the two partial saturation possibilities implies F £ MU follows again 
by Lemma 3.8. □ 



Part H of Lemma T5 handles a subtle source for errors: One could easily think 
that for F' £ MU a strict full subsumption extension yields another F £ MU, but 
this is not so, as there are three possible cases to be considered here, illustrated by 
the following examples: 
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Example 6.6 Consider F := {{i;, a}, {ii, a}, {«}, {u, a}}. So F - ^-> 1 F' for F' := 
{{a}, {u} , {u, a}}. We have F' £ MU, but F ^ MU, and indeed S {F',R,v) = 
{{a, v}, {«}, {v, a}} is a partial saturation of F' (while S(F',R,v) isn’t one). 



The condition on the resolution variable for strict full subsumption resolution 
(that it must not vanish) is exactly needed for Parts of Lemma 6.5. If this 
condition is dropped, then we speak of full subsumption resolution: 



Definition 6.7 full subsumption resolution is defined as strict full subsump- 
tion resolution, but now the resolution variable is allowed to vanish. If the resolution 
variable definitely vanishes, then we speak of if non-strict full subsumption res- 
olution. In the other direction we speak of full subsumption extension resp. 
non-strict full subsumption extension. 

So if F' is obtained from F by one step of non-strict full subsumption extension, 
then we have c(F') = c(F) + 1, n(F') = n(F) + 1 and S(F') = 6(F). 



Example 6.8 Considering the non- examples from Example 6. i : 

1. {{u},{n}} f sR y {_L}, but by full subsumption resolution we obtain {_L}. 

sfsR 

2. {{v, u>}, {v, w}, {i>, fa}} / > {{u, w}, {{«;}, {u, w;}}}, and the transition is 

also not possible by full subsumption resolution. 



3. {{v, u>}, { v , w}, {u, w}, {u}, {re}} is irreducible by full subsumption resolution. 

As follows from the characterisation of SMUs=i = URIT$=i in |^/ . a clause-set 
F £ CCS can be reduced by a series of non-strict full subsumption resolutions to 
{_L} iff F£ SMU s= 1 = UH1T5=i. 



6.2 Extensions to full clause-sets 

If we start with the full clause-sets A(V), then by strict full subsumption resolution 
we obtain exactly all unsatisfiablc hitting clause-sets: 



s fsR 

Lemma 6.9 If for some finite V C VA we have A(V) — — »* F, then F £ WHTT 
holds. And for F £ URIT we have A(vax(F)) S ^ R >* F . 



Proof: The first part follows by Lemma [b5|, Part [| (and A(V ) £ WHIT). And 
for the second part note, that if F £ URIT has a non-full clause, then an strict 
full subsumption extension step can be applied, where the result is still in URIT 
(again by Lemma 6.5, Part |c if F has only full clauses, then F = A(var(F))). □ 



Recall that in Example |6.6| we have seen, that strict full subsumption extension 
does not maintain minimal unsatisfiability in general. But from arbitrary mini- 
mally unsatisfiable F we can obtain A(var(E)), when we additionally allow partial 
saturation: 



Lemma 6.10 For F £ MU we can obtain A(var(F)) from F by a series of strict 
full subsumption extensions in combination with partial saturations. 

Proof: If F £ MU has a non-full clause, and if strict full subsumption extension 
can not be applied in order to obtain F' £ MU, then by Lemma |6.5| , Part ||, a 
partial saturation is possible. □ 



We obtain sharp upper bounds on deficiency and number of clauses in terms of 
the number of variables: 
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Corollary 6.11 For F & MU holds: 

1. 6(F) < 2 n ( F " > — n(F). 

2. c(F) < 2 n ( F \ 



In both cases we have equality iff F is full (i.e., F = A(v&r(F))). 

Proof: For Part [T] note that by Lemma |6.10 we can transform F into A(var(F) ) by 
a series of steps not decreasing the deficiency. Thus 6(F) < £04(var(F))) = 2 n ( F " > — 
n(F). For Part [|note c(F) = 6(F) +n(F) < 2 n ^ F ' > (by Part 0 ). For F = A(var(F)) 
these inequalities are indeed equalities. If we had 6(F) = 2 n ( F > — n(F ) for some non- 
full F, then some strict full subsumption extension must be possible, contradicting 
the upper bound of Part [I]. And if we have c(F) < 2 n ^ for some non-full F, then 
again some strict full subsumption extension must be possible, contradicting the 
upper bound of Part [|. □ 



We explicitly state the instructive reformulation, that the A n are the minimally 
unsatisfiable clause-sets of maximal deficiency for given number n of variables: 

Corollary 6.12 Consider m £ No and F £ MU n - m such that 6(F) is maximal. □ 
Then F = A(v&r(F)) . Thus the maximal deficiency for F £ MU n —m is 2 m — m 
(realised by A m £ MU n = m fl MU$= 2 m -m)- 

So for m = 0, 1,2, 3, 4, 5 , 6 variables the maximal deficiency of minimally unsatis- 
fiable clause-sets is 1, 1, 2, 5, 12, 27, 58; in general the deficiencies of the form 2 m — m 
are central for our investigations (note that the function to £ No H > 2 m — m £ N is 
monotonically increasing) . We are now able to determine the numbers of variables 
and numbers of clauses possible for minimally unsatisfiable clause-sets with a given 
deficiency: 



Theorem 6.13 For k £ N let o(k) £ No be the smallest n £ No with 2 n — n > k. 



1. {n(F) : F £ MUg=k} = {n £ No : n > o(k)}. 

2. {c(.F) : F £ MUs=k} = {n £ N : n > o(k) + k}. 

Pro of: Part follows by Part 0 so it remains to show Part |]. By Corollary 



5.11 we see that the left-hand side is a subset of the right-hand side. To show the 
other direction, we first note that increasing the number of variables by keeping the 
deficiency constant is achieved by one non-strict full subsumption extension step. 
It remains to show the existence of F £ MUs=k with n(F) = o(k). For k = 1 

we have F = {T}, so assume k > 1. Let F 0 := A(o(k) — 1) (so 6(F 0 ) = k — 1; 

note o(k) — 1 > 1). Add a variable by one step of non-strict full subsumption 
extension, obtaining F\ £ MUs = k-i with one new variable, and then take a clause 
in Fi without that new variable and perform one step of strict full subsumption 
extension (on that new variable), obtaining F 2 with n(Ff) = n(F\) = o(k) and 
6(F 2 ) = 6(Fi) + 1 = k. □ 

o(k) for k > 1 by definition is the smallest n > 0 with 5(A n ) > k , and by 
Theorem it is the smallest n > 0 such that there is F £ MUs=k with n(F) = n. 
We have o(l) = 0, o(2) = 2, o(3) = ■•• = o(5) = 3, 0 ( 6 ) = ••• = o(12) = 4 

and o(13) = • • ■ = o(27) = 5. Except for the first term, the sequence (o(k))k^n 

is sequence [http://oeis.org/A1035~86 in the “On-Line Encyclopedia of Integer 
Sequences” . 



13 )That is, F € MU, n(F) = m, and for all F' g MU with n(F') = m we have 5{F') < 5(F). 
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7 Non-Mersenne numbers 



In this section we study the function nM : N — > N via a recursive definition (Defini- 
tion 7.1; see Table j^). The understanding of this recursion is the underlying topic 
of this section. This recursion is naturally obtained from splitting on variables 
with minimum occurrence in minimally unsatisfiable clause-sets, and will be used 



in Theorem 8.3 later. The sequence nM is sequence ittp : //oeis . org/A062289 
the “On-Line Encyclopedia of Integer Sequences” : 



• It can be defined as the enumeration of those natural numbers containing “10” 
in their binary representation; in other words, exactly the numbers whose 
binary representation contains only l’s are skipped. 

• Thus the sequence leaves out exactly the number of the form 2 n — 1 for n £ N 
(that is, 1, 3, 7, 15, 31, . . . ), whence the name. 

• The sequence consists of arithmetic progressions of slope 1 and length 2 m — 1, 
m = 1,2,..., each such progression separated by an additional step of +1. 



fc 


1 


2 


3 


4 


5 •• 


• 11 


12 •• 


• 26 


27 •• 


• 57 


58 


nM(fc) 


2 


4 


5 


6 


8 


• 14 


16 ■■ 


• 30 


32 ■■ 


■ 62 


64 



Table 1: Values for nM(fc), fc £ {1, . . . , 58} 

The key deficiencies in Table [I] are the following two classes: 

1. The fc-values k = 1,2, 5, 12, 27, 58, . . . are the deficiencies k = 2 n — n of the 
clause-sets A n , n £ N, while the corresponding values nM(fc) = 2 k are the 
minimum variable-degree of the clause-sets A n (see Lemma 2.13), as explained 



in Subsection 1.3 



2. The fc-values 1, 4, 11, 26, 57, . . . are the positions just before these deficiencies, 



as also discussed in Subsection 1.3; we call them “jump positions”, since 
precisely at these positions the function value increases by 2 for the next 



argument (compare Definition 7.12). 



The recursion in Definition 7.1 is new, and so we can not use these character- 
isations, but must directly p rove the basic properties; the deficiencies fc = 2" — n 
will be handled in Corollary 7.23 , while the jump positions are handled in Lemma 
|7.2C|. Later we will obtain two further alternative characterisations of nM: 



• Combinatorial characterisations are obtained in Corollary 3.13 , where we will 
see that nM(fc) for fc £ N is the maximal min-var-degree for lean clause-sets 
or variable-minimal unsatisfiable clause-sets with deficiency fc. 



• In Subsection 13.2 we will develop a general recursion scheme, which has the 
function nM “built-in”, as shown in Theorem 13.15. 



Definition 7.1 For k £ N let nM(fc) := 2 if k = 1, while else 

nM(fc) := max min(2 • i, nM(fc — i + 1) + i). 

ie{2,...,fe} 



The intuition underlying Definition 7.1 of nM(fc), as later unfolded in Theorem |8.3| , 
is that we want to ge t an upper bound on the min-var-degree of an F £ MUs-k 
(recall Definition 2.11), and for that we consider a variable v £ var (F) of minimum 
var-degree, set it to 0, 1, and infer an upper bound on vdi?(u) from the two splitting 
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results. The index i runs over the possible literal-degrees of v (thus we have to 
maximise over it), where i actually is the maximum degree over both signs, and 
thus we can take the minimum with i + i for the var-degree. In the splitting results 
(v — > e) * F (e £ {0,1}) the deficiency is reduced by i — 1, since i occurrences 
(i.e., clauses) and one variable are lost, and we apply recursively the lower bound 
nM(& — (i — 1)), where then the i cancelled occurrences have to be re-added. 

Example 7.2 Computing nM(fc) for 2 < k < 5: 

1. nM(2) = min (2 • 2, nM(2 - 2 + 1) + 2) = min (4, 4) = 4. 

2. nM(3) = max(min(2 • 2, nM(3 — 2 + 1) + 2), nhn(2 • 3, nM(3 — 3 + 1) + 3)) = 
max(min(4, 6), min(6, 5)) = 5. 

3. nM(4) = max(min(2 • 2, nM(4 — 2 + 1) + 2), min(2 ■ 3, nM(4 — 3 + 1) + 3), min(2 • 
4, nM(4 — 4 + 1) + 4)) = max(min(4, 7), min(6, 7), min(8, 6)) = 6. 

4- nM(5) = max(min(2 • 2, nM(5 — 2 + 1) + 2), min(2 • 3, nM(5 — 3 + 1) + 3), min(2 • 
4, nM(5 — 4 + 1) + 4), min(2 • 5, nM(5 — 5 + 1) + 5) = max(min(4, 8), min(6, 8), 
min(8, 8), min(10, 7)) = 8. 

7.1 Basic properties 

We begin our investigations into nM(fc) by some simple bounds: 

Lemma 7.3 Consider k £ N. 

1. k + 1 < nM(fc) <2 ■ k for k £ N. 

2. For k > 2 we have nM(fc) > 4. 

Proof: The upper bound of Part |l] follows directly from the definition (by the min- 
component 2 i). The lower bounds follows by induction: nM(l) = 2 > 1 + 1, while 
for fc > 1 we have nM(fc) > min(2fc, nM(A: — k + 1) + k) = min(2fc, 2 + k) = k + 2. 
Part [l| follows by Part [j] and nM(2) =4. □ 

A basic tool for investigating sequences is the Delta-operator, which measures 
the differences in values between to neighbouring arguments: 

Definition 7.4 For a sequence a : N — > R and k £ N let A a(k) := a(k + 1) — a{k) 
he the step in the value of the sequence from k to k + 1 . 

A few obvious properties of this Delta-operator are as follows: 

1. A : R n — >• R. n is linear: A(A • a + p ■ b) = A • A (a) + p ■ A (b). 

2. a € R N is constant iff Act = (0). 

3. a is increasing iff Aa > 0, while a is strictly increasing iff Aa > 0. Here for 
sequences a, b : R N — >• K N of real numbers we use a < b V n £ N : a n < b n , 
and a < b Vn € N : a n < b n . 

The first key insight is, that the next number in the sequence of non-Mersenne 
numbers is obtained by adding 1 or 2 to the previous number: 

Lemma 7.5 For k £ N holds A nM(fc) £ {1,2}. 
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Proof: For k = 1 we get AnM(l) = 2. Now consider k >2. We have 



nM(/c + 1) = max(min(4, nM(fc) + 2), max min(2z, nM(fc — i + 2) + z)) = 

ie{3,...,fc+i} 

max min(2z, nM(fc — z + 2) + z) = 
ie{3,...,fc+l} 

max min(2(z + 1), nM(/c — (z + 1) + 2) + (z + 1)) = 

»€{2,...,k} 

max min(2z+2, nMffc— z+l)+z+l) = 1+ max min(2z+l, nM(fc— z+l)+z). 

i£{2,...,fc} i£{2,...,fc} 

Thus on the one hand we have nM(fc + 1) > 1 + niaxj e { 2 ....,fc} min(2z,nM(fc — z + 
1) T z) = l+nM(/c), and on the other hand nM(fc + 1) < 1 + max ie r 2 ,...,fc} min(2z + 
1, nM(fc — z + l)+z + l)=2 + nM(fc). □ 



Thus increasing the deficiency k by one increases nM(fc) at least by one: 



Corollary 7.6 nM : N — » N zs strictly increasing. 

And changing nM(cz + b) to nM(o) + b can not increase the value: 

Corollary 7.7 We have nM(a + b) > nM(a) + b for a £ N and b £ No, and thus 
nM(a — b) < nM(a) — b for b < a. 

Proof: We have nM(a + b) — nM(a) = ^ n M(a + z) > b ■ 1, whence the first 

inequality. Applying it yields nM(a — b) + b < nM(a — b + b) = nM(a). □ 



Instead of considering the maxi mum over k — 1 cases z £ {2, ... ,k} to compute 
nM(fc) (according to Definition 7T), we can now simplify the recursion to only one 
case i„M (k) £ {2, . . . , k }, and for that case also consi dera tion of the minimum is 
dispensable. i n M(fc) is the first index z in Definition where the minimum is 
attained by the nM-term, that is, where 2 z > nM(fc — z + 1) + z: 



Definition 7.8 For k G N, k > 2, let i n M(fc) G N be the smallest i G (2, . . . , k} 
with i > nM(£: — z + 1) (note that k > nM(A: — fc + 1) =2, and thus i n M(&) is 
well-defined). 



Example 7.9 We have 
1- inM(2) = 2. 

2. i n M(3) = 3, since nM(3 — 2 + 1) = 4, nM(3 — 3 + 1) = 2. 

3. i„M(4) = 4, since nM(4 — 3 + 1) = 4, nM(4 — 4 + 1) = 2. 

f. inM (3) = 4, since nM(5 — 3 + 1) = 5, nM(5 — 4 + 1) = 4. 

5. i n M(6) = 5, since nM(6 — 4 + 1) = 5, nM(6 — 5 + 1) = 4. 

As promised, from i n M(^) we can compute nM(fc) by one recursive call of nM: 



Lemma 7.10 For k G N, k > 2, we have: 

1. 0 < i„M {k) — nM(fc — i nM [k) + 1) < 2. 

2. Ai n M(&) G {0,1}. 

3. nM(fc) = nM(fc — i n M(A:) + 1) + i n M(fc). 
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Proof: For Part [j] we consider the sequence i i->- fk(i) '■= i — nM(/c — i + 1); 

this sequence starts with /&( 2) = 2 — nM(fc — 1) < 0, and finishes with fk(k) = 
k — nM(l) > 2, and i n M(fe) is the smallest i with fk{i) > 0. By Lemma [b5] 
we have A fk{i) = A i(i) — AnM(A: — i + 1) (z) G {1 + 1,1 + 2} = {2,3}. So for 
inM(fc)-nM(fc-i nM (A;) + l) = /fc(inM(A:)) by definition we have /fc(i n M(fc)) > 0, while 
/fc(inM(fe)) < 2 due to Afk(i n M{k)) < 3 (otherwise i n M(fc) wouldn’t be minimal). 

For Par t p| we consider the sequence k < gi(k) := i — nM(fc — * + 1). Again 
by Lemma f7A] we get A gi(k) G {-1,-2}. It follows immediately Ai n M(fc) > 0. 
Now assume Ai n M (k) > 1; thus —2 < gi nM (k)(k + 1) < 0, whence, as shown before, 
gi nM (k)+i{k + 1) > -2 + 2 = 0, and thus Ai nM (fc) = 1. 

For Part || we consider the sequence i h->- hk(i) := nM(fc — i + 1) + i; by Lemma 
P we have A hk(i) G { — 1 + 1, — 2 + 1} = {0,-1}. Thus, and by definition of 
inM(fe), we get nM(fe) = max(2-l, . . . , 2 ■ (i nM (fc) - 1), /i fe (i nM (fc))) = max(2i nM (fc) - 

2 , /i/c(i n M (&)))■ Finally /i fc (i nM (fc)) > 2 i nM (fc)-2 nM(fc-i nM (fc) + 1 ) + 2 > inM (k), 

which holds by Part □ 

We obtain an alternative, functional characterisation of i n M (k): 



Corollary 7.11 For k G N, k > 2 the value i n M(&) G {1, . . . , k} is uniquely char- 
acterised by the two inequalities 



inM(fc) > nM(fc - i n M (A:) + 1) 
inM (A:) < nM(fc - i nM (A;) + 2). 



Proof: As shown in the first part of the proof of Lemma 7. 1C, the sequence i i— >• 
fk(i) :=i — nM(fc — i + 1) is strictly increasing. □ 



7.2 Characterising the jumps 

After these preparations we are able to characterise the “jump positions”, which 
are defined as those k where the function nM increases by 2: 

Definition 7.12 Let J := {k G N : AnM(fc) = 2} be the set of jump positions. 

Thus A nM(fc) = 1 iff k (f J, and by Table [I] we see J = {1, 4, 11, 26, 57, . . .}. Note 
that nM(fc) = 1 + k + |{fc' € J : k' < k}\. It is useful to define two auxiliary 
functions: 



Definition 7.13 Let i'{k) := k — i n M(A:) + 1 G N for k G N, k > 2. And let 
h{k) := nM(i'(fc)) G N for k G N, k > 2. 

Some basic properties: 



1. We have A i'{k) = 1 — Ai n M(A;). 

2. Thus by Lemma 7.10 , Part |[ holds A i'(k) G {0, 1}. 

3. By Lemma [7.10 , Part |], we have nM(fc) = h(k) + i n M(k). 

4. Thus A h(k) = AnM(fc) — Ai n M(A:). 

5. By Lemmas L5 and |7.10| , Part |2| we get A h(k) G {0, 1, 2}. 

6. By Lemma 7.10 , Part |l| we have i n M(A:) — h(k) G {0, 1, 2}. 



7. By Corollary 7.11 we have h(k) = nM(i'(fc)) < i n M(A:) < nM(7'(fc) + 1). 
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It is instructive to consider initial values of the auxiliary functions in Table ^]; 
this table is constructed as follows: 



• The values for nM(fc) are from Table |l] (it would be possible to completely 
construct the whole table row by row, but we leave this as an exercise to the 
reader, once the section is completed). 

• The columns i ' , h duplicate the columns k 7 nM, but with repetitions. 

• Columns i' and i„M are connected via i„M +i' = k + 1. 



• Columns nM, i n M and h are connected via nM = i n M +h. 

• The values of column i„M are determined according to Corollary [7.11 by the 
condition h < i n M < h', where h'(k ) is the nM- value following h(k). 



k 


nM 


A nM 


inM 


A i n M 


i' 


Ai' 


h 


Ah 


inM —h 


1 


2 


2 


- 


- 


- 


- 


- 


- 


- 


2 


4 


1 


2 


1 


1 


0 


2 


0 


0 


3 


5 


1 


3 


1 


1 


0 


2 


0 


1 


4 


6 


2 


4 


0 


1 


1 


2 


2 


2 


5 


8 


1 


4 


1 


2 


0 


4 


0 


0 


6 


9 


1 


5 


0 


2 


1 


4 


1 


1 


7 


10 


1 


5 


1 


3 


0 


5 


0 


0 


8 


11 


1 


6 


0 


3 


1 


5 


1 


1 


9 


12 


1 


6 


1 


4 


0 


6 


0 


0 


10 


13 


1 


7 


1 


4 


0 


6 


0 


1 


11 


14 


2 


8 


0 


4 


1 


6 


2 


2 


12 


16 


1 


8 


1 


5 


0 


8 


0 


0 



Table 2: Values of auxiliary functions; underlined the jump positions 



First we show some further simple properties of the auxiliary functions: 



Lemma 7.14 Consider k > 2. 



1. If Ai n M(k) =0, then: 

(a) Ai nM (fc + 1) = 1. 

(b) WW - h{k) G {1,2}. 

(c) i n M (k + 1) = h(k + 1). 

2. A i n M(fc) = 1 <?=> A i'(k) = 0o A h(k) = 0. 

3. //Ai n M(fc) = 1, then: 

(a) k £ J. 

(b) i nM (fc) - h(k) G {0, 1}. 



Proof: For Part |la| assume Ai n M(fc + 1) = 0 (and thus A i'(k + 1) = 1 due to 
Ai' = 1 — A i n M)- Because of Ah = A nM — A i n M we obtain Ah(k + 1) > 1. Thus 
inM(fc) — inM(^T2) > /i(fc + 2) > h(fc+ 1) + 1 = nM(z , (fc-|- 1)) + 1 = nM(i , (fc) + l) + l, 
contradicting i n M(fc) A nM(z'(fc) + 1). For the remainder of Part |l] note A h(k) = 
AnM(fc) > 1. 

For Part lb note i n M(fc) = i n M(fc + 1) > h(k + 1) > h(k) + 1. 
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For Part 0 assume i n M(fc + 1) > h(k + 1). Thus i n M(&) = i n M(& + 1) > 
h(k + 1) + 1 > h[k) + 2, whence i n M {k) = h(k) + 2. If we would have A h(k) = 2, 
then i n M(fe) = i n M(fc + 1) > h{k + 1) = h(k) + 2; thus h(k + 1) = h(k) + 1. Now 
InM {k'j = h(k) + 2 = h(k + 1) + 1 = nM(i'(fc + 1)) + 1 = nM(i'(fe) + 1) + 1, a 
contradiction. 

Part U is obvious, and Part [ia| follows. Finally, Part [ft] follows by i n M(fc + 1) < 
h(k + 1) + 2 and i n M (k + 1) = i n M {k) + 1, while h{k + 1) = h(k) due to Part ||, 
whence i n M(&) < h(k) + 1. □ 

We obtain the main characterisation of the jump positions via the auxiliary 
functions: 



Theorem 7.15 For k > 2 the following conditions are equivalent: 

1. k G J 

2. A h(k) = 2 

3 . i nM (fc) = h(k) + 2 

4- Ai n M(fc — 1) = 1 and i n M (k — 1) = h(k — 1) + 1 

5. Ai n M(fc — 2) = Ai n M(fc — 1) = 1 (yielding various equivalent forms via Lemma 
7 .14, Part |[). 



Proof: Condition p] implies Condition |] due to Ai„M(fc) = 0 in case of k € J by 
Lemma [7.14 , Part [la[ Conditi on p| implies Condition [], since A h(k) = 2 implies 
Ai„M(fc) = 0, and so by Lemma [7. Id} Part [Tc|we have i n M(fc) = i n M(&+l) = h(k+l), 
while the assumpti on say s h(k+ 1) = h(k)+ 2. In turn Condition [] implies Condition 
0, since by Lemma 7.14 , Part 3b we get Ai n M(fc) = 0, and thus AnM(fc) = A h(k), 
where in case of A h(k) < 1 we would have h(k) + 2 = i n M(fc) < nM(t'(i) + 1) = 
nM (i'(k + 1)) = h(k + 1) < h(k) + 1. So now we can freely use the equivalence of 
these three conditions. 

Condition [] imp lies C ondition |~i|. since we have A i n M(fc) = 0, and thus Ai n M(fc — 
1) = 1 with Lemma |7 ~m| . Part 0 from which we furthermore get i n M(fc) = inM (k — 
1) + 1 and h{k— 1) = h(k), and so i n M(& — 1) = i n M(^) — 1 = h(k) + 1 = h(k— 1) + 1. 
Condition ^ implies C ondit ion [s|, since in case of A i n M(fc— 2) = 0 we had i„M(fc— 1) = 
h{k — 1) with Lemma |7.14| , Part [lc]. In turn Condition [ 5 ] implies Condition [| since 
inM(fc) = i n M (fc — 1) + 1 = i n M(fc — 2) + 2, while h(k) = h(k — 1) = h{k — 2), where 
by definition i n M(fc — 2) > h(k — 2) holds, whence i n M(fc) > h(k) + 2, which implies 
inM (^) = h(k) + 2. □ 



We understand now the shape of the four A-sequences: 



Corollary 7.16 By definition the sequence (AnM(fc))fc £ N is 1 except at the jump 
positions k, where it is 2. The other three A - sequences are shaped as follows: 

1. The sequence (Ai n M(fc))fceN,fc >2 consists of alternating 0,1’s except the two 
positions k — 2, k — 1 before a jump position k G J , where we have two con- 
secutive 1 ’s (while at the jump position we have 0). 

2. The sequence (A*' (fc))fc<=N,fc >2 consists of alternating 0, 1 ’s except two positions 
before a jump position k, where we have two consecutive 0 ’s. 

3. The sequence (Ah(k))k^n t k >2 consists of alternating 0, 1 ’s except two positions 
before a jump position k, where we have two consecutive 0 ’s, followed by a 2 
at the jump position k, which is followed by 0. 
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Proof: Part [i: By Lemma mg we have Ai n M(ft) = 0 => Ai n M(fc + 1) = 1, 
while by Theorem 7.15, Part 5| we have Ai n M(fc) = Ai n M(fc + l) = l=>ft + 2£,/, 
and by Lemma [7T4 , Part w e have k £ J => Ai n M(fc) = 0. 

Part 2 follows from Part [l| by A / = 1 — A i n M- 

Part 3: By Lemma 7.14, Part 0 the 0's in the sequence Ah are precisely the l’s 
in the sequence Ai n M, while a 0 of Ai n M translates into a 2 precisely at the jump 
positions by Theorem 7.15, Part The assertion follows now by Part |]. □ 



Especially instructive is understanding of the /-sequence: 



Corollary 7.17 The / -sequence (/(fc))feeN,fe>2 consists of doublets to, to for con- 
secutive m = 1 , 2 ,...,, except for k £ J\{1}, where we have at positions k— 2 , k— 1 , k 
a triplet to, to, to. These triplet-values occur exactly when m £ J . 



Proof: The doublet/triplet structure follows by Corollary 7.16 

consider a triplet /(ft — 2) = i'(k — 1) = i'(k) = to for ft 



Part @ 

J \ {1} , to £ 



Now 
N. By 



£ J Uif , m 

definition we have AnM(m) = A h(k) (due to h(k) = h(i'(k)) = nM (rn), h(k + 1) = 
nM(/(fc + 1)) = nM(/(ft) + 1) = nM(m + 1)). By Theorem 0 Part 0 we have 
thus have AnM(m) = 2, i.e., m £ J. The triplets do not leave out some jump- 
value in J, since for to £ J and for the last position k with /(ft) = m we have 
AnM(m) = A h(k). □ 



Example 7.18 We see now how we can built up the three columns k, nM, i 1 of Table 
0 together with an enumeration of the set J, which is built up as the set I (I is an 
initial part of J, which in the limit becomes J ): 

1. We start with the first row k := 1, initialising the value n of nM(l) = n 
to n := 2, while i n M is undefined; k = 1 is the first jump position, that is, 

/:-!'}• 

2. We go to the second row, k := k + 1. We update n := n + 2 and initialise the 
running value ofi'(k) = to to m := 1. 

3. We repeat the following steps ad infinitum: 

(a) If to £ I, then three rows are created: 

i. i n M (A) = n, i'(k ) = m, k := k + 1, n := n + 1 

ii. inM(fc) = n, i'(k ) = to, k := k + 1, n := n + 1 

in. inM(fc) = n, i'(k) = m, I := /U{fc}, k := k + 1, n := n + 2, 
to := to + 1 . 

(b) If to ^ I, then two rows are created: 

i. i n M (A) = n, i'{k) = m, k := k + 1, n := n + 1 

ii. i n M(fc) = n, i'{k) = m, k := k + 1, n := n + 1, to := m+ 1. 

Next we show that i'{k) for jump positions is the previous jump position: 



Lemma 7.19 For k £ J, k > 2, holds i'(k ) = max{fc' £ J : k' < k}. 



Proof: We prove the assertion by induction on k (regarding the enumeration of J). 
We have /( 4) = 1, and so the induction holds for k = 4, the smallest jump position 
k > 2. Now assume that the assertion holds for all elements ofJD{l,...,fc — 1}, 
where k > 4, and we have to show the assertion for k. By Corollary 7.17 we know 
i'{k) £ J, where 2 < /(ft) < k. Assume there is m £ J with /(ft) < to < k. By 
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7.14 we get 
□ 



induction hypothesis we get i'(k) < i'(m) < m. However by Lemma 
A i'{m) = 1, and thus i'{k) > i'(m) (since k > m). 



We obtain the promised characterisation of the jump positions: 



Lemma 7.20 We have J = {2 m+1 — (m + 1) — 1 : to £ N}. 

Proof: Let k m for to G N be the mth element of J; so the assertion is k m = 

2 m +i — to — 2. We have k\ = 4— 1 — 2 = 1 = min J; in the remainder assume 
to > 2. We prove the assertion by induction, in parallel with i n M(fcm) = 2 m+1 — 2 m . 
For to = 2 we have &2 = 8 — 2 — 2 = 4 = min J \ {1}, while i„M(4) is the smallest 
i £ {2,3,4} with i > nM(5 — i), which yields i„M(4) = 4 = 2 3 — 2 2 . Now we 
consider the induction step, from m — 1 to m. The induction hypothesis yields 
k m - 1 = 2 m — to — 1 and i n M(fcm-i) = 2 m — 2 m_1 . Lemma 7.19 yields i'(k m ) = k m - 1 , 
from which by i'(k m ) = k m — i n M (fc m ) + 1 follows 

km = 2 TO 2 + inM(^m)- 

Via a telescoping series we get 

inM(^m) — A i n M(^m 1) 1 h A inM^m — l) T inM^m— l)- 



By Corollary 7.16 , Part |l] the sequence A i nM (fc m _!), . . . , A i nM (fc m — 1) has the 
form 0,1, 0, 1, . . . , 0, 1, 1, and thus their sum has the value \{k m — fc m _i — 1) + 1. 
So we get 



InM(^m) — 2 (^m k m —l 1) “t“ 1 T inM^m — l) — 

i(2 m - TO - 2 + i nM (fc m ) - 2 m + TO + 1 - 1) + 1 + 2 m - 2 m_1 = 

2 InM {km) — 1 + 1 + 2 m — 2 m 1 , 

from which i n M(fcm) = 2 m+1 — 2 m follows. Finally k m = 2 m — to — 2 + 2 m+1 — 2 m = 
2 m+1 -to - 2. □ 



7.3 Applications 

Now the closed formula for nM(fc) can be proven: 

Theorem 7.21 For k £ N let fld(fc) := [ld(fc)J . Then we have for k £ N the 
equality nM(/c) = k + fld(/c + 1 + fld(fc + 1)). 

Proof: Let g(k ) := fld(fc + 1 +fld(fc+ 1)) and f(k) := k + g(k ) (so nM(fc) = f(k) is 
to be shown, for k > 1). We have /( 1) = l+fld(2 + fld(2)) = l + fld(3) = 2 = nM(l). 
We will now prove that the function g{k) changes values exactly at the transitions 
k k + 1 for k £ J, that is, for indices k = k m := 2 m+1 —to — 2 (using Lemma 
7.2({ ) with to £ N we have A g(k m ) = 1, while otherwise we have A g(k m ) = 0, from 
which the assertion follows (by the definition of J). 

We have g( 1) = 1 and g( 2) = 2. Now consider to £ N and k m + 1 < k < k m + 1 - 
We show g{k) = m + 1, which proves the claim. Note that g(k) is monotonically 
increasing. Now g(k) > g(k m + 1) = [ld(2 m+1 — to+ [ ld(2 m+1 — to)J )J = Lld(2 m+1 — 
?n + TO)J = to + 1 and g(k ) < g(k m +i ) = I ld(2 m+2 — ?n — 2+ I ld(2 m+2 — to — 2)J)J < 
|_ld(2 m+2 - m - 2 + to + 1)J = [ld(2 m+2 - 1)J = to + 1. □ 

As a result, we obtain very precise bounds for nM(fc): 
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Corollary 7.22 k + fld(/c + 1) < nM(&) < k + 1 + fld(fc) holds for k £ N. 

Proof: The lower bound follows trivially. The upper bound holds (with equality) 
for k < 2, so assume k > 3. We have to show g{k) = fld(fc+l+fld(A;+l)) < l+fld(fc), 
which follows from ld(fc + 1 + fid (A: + 1)) < 1 + ld(fc). Now ld(fc + 1 + fld(fc + 1)) < 
ld(fc + 1 + ld(fc + 1)) < ld(Jfe + *) = ! + ld(fc). □ 



Note that (A; + 1 +fld(/c)) — (fc + fld(/c + 1)) S {0, 1}, where this difference is zero 
iff k + 1 is a power of 2. Finally we can prove the already mentioned characteri- 
sation, which motivates the terminology of “non-Mersenne numbers” , namely that 
(nM (fc))„ g N enumerates N\ {2 n — 1 : n £ N}.^ F or tha t we consider the positions 
directly after the jump positions, which by Lemma 7.20 are the positions 2™ — n for 
n > 2. From that position on until the next jump position, which is 2 n+1 — n — 2, 
the nM- values increase constantly by 1 per step. So we just need to understand the 
values of nM(2 n — n), to understand all of nM, which is achieved as follows (note 
that (2 n+1 - n - 2) - (2 n - n) = 2 n - 2): 



Corollary 7.23 Consider n £ N, k := 2™ — n, and m £ No with m < 2™ — 1. 



1. nM(it) = 2". 

2. More generally for m < 2 n — 1 holds nM(fc + m) = 2 n + m. 

3. For m = 2 n — 1 we have k + m = 2 n+1 — (n + 1), and thus nM(fc + m) = 2" +1 . 



Proof: By Theorem [7.21, Part p] follows with nM(2 n — n) = 2 r ‘ 
1 + fld(2 n - n + 1)) = 2 n - n + fld(2 n - n + 1 + (n - 1)) = 2 n 
Part follows by Lemma 7.20[ and Part [| follows by Part |l]. 



- n + fld(2" — n + 
n + fld(2 n ) = 2". 

□ 



Besides nM(2 ra — n) = 2 n also the following special value is of importance: 



Corollary 7.24 For n € N, n > 2, we have nM(2" — n — 1) = 2" — 2. 

It is also useful to have simple formulas for the i n M(fc)-values around the jump 
positions: 



Corollary 7.25 For n £ N, n > 3 the values of i n M(2 n — n + to) are as follows, 
using p := 2 n_1 (where for m = —4 we need n > A): 



m 


-4 -3 —2 —1 0 1 


2 


3 


4 


InM 


p — 2 p — 2 p — 1 p p p+1 


p+1 


p + 2 


p + 2 



Proof: We have i„M(2" — n) = 2 n 1 by Corollary 7.11 



2 n ~ 1 > nM(2 n — n — 2 n ~ 1 + 1) = nM(2 n_1 — (n — 1)) = 2 n ~ 1 
2 n ~ 1 < nM(2" — n — 2 n ~ 1 + 2) = nM(2 n_1 — (n — 1) + 1) = 2 n ~ 1 + 1. 



The remaining 






□ 



We conclude with an alternative characterisation of the jump-set J: 



Corollary 7.26 For k £ N the following conditions are equivalent: 

1. nM(fc) < 2 • i n M (k) — 1. 

2. nM(fc) = 2 • i n M (A:) — 2. 

11 -'Note that we are not speaking of “non-Mersenne primes ”. 
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to — 2 for some to € N. 



3. k £ J , that is, k = 2 m+1 — 



Proof: If k £ J, then by Theorem 7.15 , Part [| we have nM(fc) = 2 • i n M (k) — 2 < 
2 • inM(fc) — 1- And if k ^ J, then by the same lemma we have i n M(fc) < h(k) + 1, 
and thus nM(fc) = h(k) + i n M (k) > 2 • i„M(fc) — 1. □ 



8 The min-var-degree upper bound for M.U 

In a sense the main auxiliary lemma of this report is the following statement on 
the deficiencies obtained when splitting a saturated minimally unsatisfiable clause- 
set, which receives its importance from the fa ct t hat every minimally unsatisfiable 
clause-set can be saturated (recall Subsection L2; this method was first applied in 
this context in fl9|). 



Lemma 8.1 Consider F £ SMlAs=k for k £ N and a variable v £ var (F) realising 
the minimum var-degree (i.e., vdp(v) = pvd(F )). Using mg := ldp^r?) and TOi := 
ldi?(t>) we have (v —> e) * F £ MUs=k-m e +i for e £ {0, 1}, where n((v — > e) * F) = 
n(F) — 1. Since minimally unsatisfiable clause-sets have deficiency at least one, we 
get m e < k. 

Proof: We have n((v — > e) * F) = n(F) — 1 since F contains no pure variable, 
while v realises the minimum of var-degrees. Thus S((v — >■ e) * F) = 6(F) — m e + 1, 
while (v — > e) * F £ JvUA by Lemma m Part [j]. □ 

Some explanations on this fundamental lemma: 



Example 8.2 If in the situation of Lemma 8.1 the value of m e is minimal, i.e., 
m E = 1, then we have 5((v — > e) * F) = 5(F) = k, while if m e is maximal, i.e., 
m e = k, then we have 5((v — > e) * F) = 1. The deficiency is strictly decreased for 
both splitting results iff v is non-singular. The point of v realising the minimum 
var-degree is, that we have control over the number of eliminated variables (namely 
no further variable is eliminated). If v £ var (F) is arbitrary, then 6((v — > e) * F) = 
k — m e + 1 + r, where r is the number of variables in F which occur only in the 
clauses containing v for e = 0 resp. in the clauses containing v for e = 1 . 

A cla ss o f concrete examples is given by the T rl £ SA4Ug_ 2 (n > 2; recall 
Example Iff.cj ), where for every v £ var(J r „) and e £ {0, 1} holds (v — > e) * T n £ 
AiUs = \ (since every literal of F n has degree 2). 



The definition of nM(fc) (recall Definition 7.1) matches the recursion-structure 
of Lemma 8.1, and we obtain an upper bound on the min-var-degree for minimally 



unsatisfiable clause-sets: 



Theorem 8.3 For all k £ N and F £ MU$<k we have /rvd (F) < nM(fc). More 
precisely, for n(F) > 0 there exists a variable v £ var (F) with vd^v) < nM(fc) and 
< k. 

Proof: The assertion is known for k = 1, so assume k > 1, and we apply induction 
on k. Assume 8(F) = k (due to k > 1 we have n(F) > 1). Saturate F and obtain F' . 
Consider a variable v £ var(F') realising the min-var-degree of F' . If vdjv(i;) = 2 
then we are done, so assume vd f'(v) > 3. Let i := max(ldpv(i>), ldjv(z))); so 
vd F /(v) < 2 i. W.l.o.g. assume that i = ld F /(r>). By Lemma |0] we get 2 < i < k. 
Applying the induction hypothesis and Lemma ^d] we obtain a variable w £ var (G) 
for G := (v — »• 1)*F with vdc(w) < nM(fc — z + 1). By definition we have vdF'(w) < 
vdc(w) + ld^/(n). Altogether fivd(F) < min(2i,nM(fc — « + !) + *) < nM(fc). □ 
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The upper bound on the minimum variable degree of Theorem 3.3 is not sharp, 
and will be further investigated from Section on. However the bound is attained 
for infinitely many deficiencies, and we demonstrate now that the jump positions 
(the set J; recall Definition 7.12) are such deficiencies. Moreover, to investigate the 
remaining deficiencies, we show that they always have at least two variables realising 
the bound (if the bound is attained at all); this will be used to prove Theorem 14.3. 
So we consider “extremal” F £ MUs=k with /xvd(F) = ^nM(fc), and we show that 
such extremal clause-sets have at least two different variables of minimal degree, 
if k J . First, it is useful to have a notation for the set of variables of minimal 
degree: 



Definition 8.4 For F £ CCS let var /xv d(i r ') C var(F) be the set of variables of 
minimal degree, that is, var^vd (F) := {v £ var (F) : vdp(v) = p,vd(F)}. 

Obviously var MV d (F) ^ 0 iff n(F) > 0, and var^vd (F) = var (F) holds iff F is 
variable-regular. 



Lemma 8.5 Consider k £ N. 



1. For k £ J and F £ AiUs=k with fivd(F) = nM(fc) we have |var /lv d(i 7 ')| > 2. 

2. For k £ J there is F £ WHXTs=k with /ivd(F) = nM(fc) and |var /Jv d(F)| = 1. 

Proof: First assume k £ J; we have to show the existence of different v,w £ 



var Alv d(F'). W.l.o.g. F is saturated. Consider v £ /xvd(F). By Corollary [7.2q we 
have nM(fc) > 2 ■ i n M (fc) — 1. Because of kip (v) + ldj’(v) = nM(fc) thus w.l.o.g. 
ei := ldp(u) > i n M(fc)- Let F' := (v — > 1) * F. So S(F') = k — e i + 1. Recall 
nM(fc) = nM(fc — i n M(fc) + 1) + i n M(&) (Lemma S Part ||), and thus nM(fc) > 
nM(fc — ei -(- 1) + e\. Since n(F) > 2, we can consider w £ var^vd {F'). We have 
vdF'(w) < nM(fc — ei -F 1) and vd^(w) = vdF'(w) + e\. Thus w £ var^vd (F). 

Now assume k £ J, that is, k = 2 m+1 — m — 2 for m > 1. For k = 1 we 
have the example {1,-1}, so assume k > 2. Then we have nM(fc) = 2 m+1 — 2. 
Now we obtain an example from A m+ \ by performing one strict full subsumption 
resolution: The resolution variable occurs 2 m+1 — 2 times, the other to — 1 variables 
occur 2 m+1 — 1 times. □ 



In Lemma 12.11 we formulate the sharpness of the upper bound of Theorem p.3| 



for these cases. 



9 The min-var-degree upper bound for CSAAf 



In this section we prove Theorem |9.8| , the upper bound nM(fc) on the min-var- 
degree for lean clause-sets of deficiency k, and the sharpness of this upper bound 
for any class be twee n VMU and CSAAf in Theorem |9.12| . The proof consist s in 
lifting Theorem 3.3 to the general case in Subsection |9.2|, while in Subsection |f).l| 



we introduce the aux iliary class SSV of clause-sets, where deficiency and surplus 
coincide; Lemma |[).5| there shows that unsatisfiable elements of SSV are variable- 
min imally unsatisfiable. Sharpness of the upper bound in considered in Subsection 
3 . 3 . 



9.1 Clause-sets with extremal surplus 

We consider the task of generalising Theorem |R3| to F £ CSAAf. Consider an 
arbitrary (multi-)clause-set F. Consider a set of variables 0 ^ V C var(F) realising 
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the surplus of F, i.e., such that 5(i r '[V’]) is minimal (recall Definition 2.19). If F\V\ 
would be satisfiable, then a satisfying assignment would give a non-trivial autarky 
for F. Assuming that F is lean thus yields that F[ V] must be unsatisfiable. So there 
exists a minimally unsatisfiable F' C F[V]. If now var(E') ^ var(.F|V]) = V would 
be the case, then we would loose control over the deficiency of F' . Fortunately 
this can not happen, as we will show in Lemma |9.5| . To understand this result, 
the following class of clause-sets with maximal surplus (relative to the deficiency) 
is important. 



Definition 9.1 Let the class SET) C CCS (“surplus equal deficiency”) consist of 
those clause-sets F £ CCS with cr(F) = 5(F). 

It seems the class SET crosses the classes considered in this report in interesting 
extremal cases. 



Example 9.2 Some basic examples: 



1. We have T £ SET and {_L} ^ SET , and more generally, for every F £ SET 
we have _L £ F. 



2. For a clause C £ CC we have <5({C}) = 1 — 1(71 and a({C}) = 0, and thus 
{C} £SET ^\C\ = 1. 

3. For F ■= {{1}, {2}} we have a(F) = 5(F) = 0, and thus F £ SET. However 
for the multi- clause- set F' := {2 * {1},{2}} we have 5(F r ) = 1, while still 
cr(F') = 0, and thus F' SET . 



4- 

5. 

6 . 

7. 

8 . 

9. 

10 . 



Another example for F £ SET with 5(F) = 0 is F := {{1, 2}, { — 1, 2}}. 
A n £ SET for n > 1 (Example 2.20) . 



MUs=i \ {{_L}} C SET (since for F £ MU \ {{_L}} holds a(F) > 1). 
T n £ SET for n > 2 (Example \ 3.0) . 



For F := {{1, 2, 3}, {1, 2, -3}, {1, -2}, {-1, 2}, (-1, -2}} we have F £ MU 
with 5(F) = 2, but a (F) = 1, and thus F SET. 



In Defin ition 10 . 4 we introduce the subclass MCC1Z C SET fl SAT, and 
Example IOC/ shows elements of this class. 



See also Example 10.11 and Question 10. Ik. 



Finally we note that F £ SET if F' £ SET , where F' is the multi-clause-set 
obtained from F by forgetting all signs of the literals, i.e., replacing clauses C £ F 
by var((7) (since 5(F') = 5(F) and cr(F') = cr(F)). 



The corresponding class SET of nmlti-clause-sets is not invariant under multi- 
plicities; consider a multi-clause-set F and the underlying clause-set F': 

1. If F' £ SET, then in general we do not have F £ SET (Example |9.2| ). 

2. However in general holds F £ SET => F' £ SET, since if we would have 
F' SET , then a(F') < 5(F'), and adding a duplicated clause to a multi- 
clause-set increases 5 by +1, while a is at most increased by +1 (it may also 
stay unchanged). 

A simple but instructive equivalent formulation of SET is as follows, which also 
yields a stronger corollary than the above “F £ SET => F 1 £ SET ” : 
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Lemma 9.3 For a multi-clause-set F we have F £ S£T> if and only if for all 
0 C V C var (F) and for the sub-multi-clause-set F v < F consisting of all C £ F 
with var(C) C V (with the same multiplicities) we have c(F' v ) < |V|. 

Proof: For arbitrary F £ CCS and every icFC var(F) we have 

c(P[P])+c(P var(F) \ v ) = c(F), 

and thus we get, using V' := var (F) \ V: 

5(F[y]) > 6(F) & c(F[P]) - \V\ > c(F) - n(F ) 

c(F ) - c (F val ^\ v ) - \V\ > c(F) - n(F) & c(F v ') < \V'\. 

These V' run through all |Cf'c var(F). □ 

We remark that c(F 0 ) < 0 O _L F. We obtain as an immediate corollary, 
that decreasing multiplicities in F £ S£V does not leave this classes (even if the 
multiplicity drops to zero): 

Corollary 9.4 For F £ S£V and F' < F we have F' £ S£V . 

Unsatisfiable elements of S£ V have a strong structure: 

Lemma 9.5 S£D HUS AT CVMU fand thus S£V Ci USAT C VMM ). 

Proof: Consider F £ S£V 0 US AT, and assume there is an unsatisfiable F' C F 
with var(F') C var(F); consider a minimally unsatisfiable sub-clause-set F" C F' . 
By definition we have for V := var(F) \ var (F") ^ 0: 

6(F") = c(F") - n(F") = c(F ") - (n(F) - n(F[V}) < 

(c(F) - c(F[V])) - (n(F) - n(F[V]) = 6(F) - 6(F[V ]) < a(F) - a(F) = 0 

contradicting 6(F") > 1 (since F" £ MU). Thus we have S£V HUS AT C VMU. 
An example of F £ MU \ S£ V is given in (Example |9.2[ Finally consider F £ 
S£V (~1 US AT : thus for the underlying clause-set F' holds F' £ S£V n US AT, 
whence F' £ VMU, and thus F £ VMU . □ 



We conclude this subsection by considering the complexity of SAT decision for 
F £ S£T>s=k for parameter k £ N. By Lemma 9.5 we could use Theorem 4.7, how- 
ever we have < 5£T>s=k C MC£AAf, and thus we can apply the fpt-result discussed 
in Example [2.18, and thus SAT decision for inputs in S£T>s=k is fpt in k. 



Question 9.6 Can SAT decision for S£T> be done in polynomial time? If so, can 
we also find a satisfying assignment quickly? 



9.2 The generalised upper bound 



Back to the main task, the central lemma utilises Lemma 9.5 to show that from 
extremal F\V\ we obtain variables of low degree for F itself: 



Lemma 9.7 Consider a multi- clause- set F and 0CkC var (F) such that F\V\ is 
unsatisfiable and a(F[V ]) = <S(F , [V]) > 1 (whence F\V] £ S£V fl MC£AJ\f ). Then 
there exists v £ V with vdf(v) < nM(<5(F[P])) and ld_p(u), ldf(jj) < 5(F[yj). 
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Proof: Let F' := F[F] and consider some minimally unsatisfiable F" C F' . By 
Lemma pTs] we have var (F") = var(F'). So we get 6(F") = S(F') — ( c(F' ) — c(F")). 
By Theorem 8.3 there is v £ var (F") with 

vd F «(t>) < nM (6(F")) = nM (6(F') - (c(F') - c(F "))) < 

nM(<5(F')) - (c(F') - c(F ")) 

and ld F //(u), ld F »(u) < 6(F") = 6(F') — (c(F') — c(F")). Finally we have vd F (i>) < 
vd F "(w) + (c(F') — c(F")) (note that all occurrences of v in F are also in F'), and 
similarly for the literal degrees. □ 

We are ready to show the generalisation and strengthening of Theorem |8.3| : 

Theorem 9.8 We have pvd(F) < nM(cr(F)) for a lean multi-clause-set F with 
n(F ) > 0. More precisely , there exists a variable v £ var (F) with vd F (i>) < 
nM(cr(F)) and ld F (n), ld F (F) < u(F). 

Proof: Recall that F is a lean multi-clause-set with n(F) > 0, and we have to show 
the existence of a variable v with vd F (w) < nM(er(F)) and lcl F (n), ld F (v) < a(F). 

Consider 1/fC var(F) with <5(F[P]) = cr(F), and let F' := F[V}. F' is 
unsatisfiable, since F is lean. Because of S(F') = <r(F) we have S(F') = cr(F'). So 
we can apply Lemma 9.7. □ 

Since for a variable v £ var (F) for any F £ CCS holds 

^(F[{u}]) = vd F (u) — 1 > fj,vd(F ) — 1 > <j(F), 

and the surplus is a lower bound for the deficiency, we get: 

Corollary 9.9 For a lean multi-clause-set F, n(F) > 0, we have 

<j(F) + 1 < iavd(F) < nM(<r(F)) < a (F) + 1 + fld(<j(F)) 
livd(F) < nM(<5(F)) < 6(F) + 1 + fld(5(F)). 

That the bounds from Corollary |9.9| are sharp in general, is shown by the fol- 
lowing examples. 

Example 9.10 First we consider any lean clause-set f / T, and perform a non- 
strict full subsumption extension F ^ F' . Obviously F' is lean as well (with S(F') = 
6(F)). Then we have pvd(F') = 2 and <j(F') = 1, and thus 

2 = a(F') + 1 = fivd(F') = nM(a(F')) = a(F) + 1 + fld(cr(F)), 

while S(F') is unbounded. This construction will be taken up again in Lemma 10.1!\ 
Now we turn to the 6-upper bounds. For n > 2 consider A n . We have a(A n ) = 
6(A n ) = 2™ — n by Example 2.2C. Thus here the inequalities of Corollary \0. f| are 



2 n — n + 1 — (j(A n ) + 1 < 

2 n = /xvd (A n ) = nM(<5(A„)) = 5(A n ) + 1 + fld(<5(A„)) 



(using Corollary 7 A 
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9.3 Sharpness of the bound for VM.U 

We now show that for every deficiency k there are variable-minimally unsatisfiable 
clause-sets where the min-var degree is nM(fc) (strengthening Example 9.10 ). The 
examples are obtained as follows: 

Lemma 9.11 For a clause-set F £ CCS, T ^ F and n(F ) > 0, with at least one 
full clause, consider the following construction of F' £ CCS: 

1. Let C be a full clause of F. 



2. Let F" be a full singular unit- extension of F (recall Definition 5.15). 

3. Let F' := F" U{C} . 

We have the following properties: 

1. u(F') = cr(F) + 1, 5(F') = 5(F) + 1, fivd(F') = /ivd(F) + 1. 

2. F £ S£V => F' £ SSV. 

3. F £ US AT => F' £ US AT. 



Proof: With Lemma 5.17 we get a(F") = cr(F), 5(F") = 5(F), fivd(F ") = 

livd(F). Obviously 5(F') = 5(F") + 1. Let var (F") \ var (F) = {v}. To see 
jjvd(F') = /ivd(F") + l, we note that for w £ var (F) we have vdjv(to) = vd_F »(w) + 
1, while vdF'( ,(; ) = c(F') — 1 = c(F") > vd pn(w) + 1. 

To prove a(F') = cr(F") + 1, we consider 0 C V C var(F') = var(F"). If 
v £ V, then <5(F'[y]) = <5(W"[W]) + 1, since C is full for F. If V = {r}, then 
5(F'[V]) = 5(F"[V}) = c(F") > a(F") + 1. Finally, if V D {u}, then 5(F'[V]) = 
c(F') - \V\ > S(F') = 5(F") + 1 > a(F") + 1. 

The implication F £ SSV =>■ F' £ SSV follows now by definition of SSV, and 
F £ US AT => F' £ US AT is trivial. □ 



With the construction of Lemma 9.11 we now show that the general upper bound 
on the min-var degree of lean clause-sets is tight for variable-minimally unsatisfiable 
clause-sets: 



Theorem 9.12 For a class VMU D SSV C C C CSAAf and k £ N we have 
pvd(Cs=k) = nM(fc). 



Proof: By Theorem 9.8 it remains to show the lower bound pwd(VJvlU$=k H 

SSV) > nM(fc). For deficiencies k = 2 n — n, n £ N we have nM(fc) = 2 n , and 
thus A n serves as lower bound example (as sh own in Example HI) , wh ile until the 
next jump p ositio n we can use Lemma 9.11 together with Lemma |9.5| , where due 
to Corollary 7.23 in this range also nM increases only by 1 for k k + 1. □ 



Using Lemma 4.3, we can now determine the min-var degrees for the classes 
CSAAf and VMU, separated into layers via deficiency or surplus: 



Corollary 9.13 For k £ N holds nM(fc) = p-vd(C£ A\Ts=k) = /rvd (CSAAf a =k) = 
livd(VMU s =k) = p.vd(VMU a=k )- 
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10 Algorithmic implications 



In Subsections |10.1|, |1 0 . 2| we consider the algorithmic implications of Theorem 9.8. 
First in Theorem 10. 2| we show that via an autarky-reduction every clause-set F £ 
CCS can be reduced to some F' C F, where F' fulfils the min-var-degree upper 
bound of Theorem |9.S| (although F' might not be lean). For this autarky- reduction 
we do not know whether we ca n efficiently compute a certificate, the autarky, and 
we d iscuss the Conjecture 10.3 , that efficient computation is possibl e, in Subsection 
10. 2|. We conclude with some remarks on the surplus in Subsection 10.3. 



10.1 Autarky reduction 



By Theorem |9.8| lean clause-sets fulfil a condition on the minimum variable-degree — 
if that condition is not fulfilled, then there exists an autarky. In this section we try 
to pinpoint these autarkies. We consider a vast generalisation of lean clause-sets, 
namely matching-lean clause-sets (recall Subsection |2.6| , especially that a multi- 
clause-set F with n(F) > 0 is matching-lean iff a(F) > 1). It is also useful to note 
here the observation that a (multi-)clause-set F has a non-trivial autarky (is not 
lean) iff there is 0 C V C var(F) such that F[V] is satisfiable, and the corresponding 
autarky reduction of F removes all clauses containing some variable of V ; note that 
to perform this autarky reduction the autarky itself (the satisfying assignment for 
F[V]) is not needed, only its set V of variables. 

We obtain a sufficient criterion for the existence of a non-trivial autarky by 
considering the converse of Theorem 9.8: 



Lemma 10.1 Consider a matching-lean multi- clause- set F with n(F) > 0. If we 
have pvd(F) > nM(cr(F)), then for all F' := F[V] with 0 C V C var(F) and 
i5(F[k]) = cr(F) we have: 

1. S(F') = a(F') = a(F) (so F' e STD n MCE AN) . 

2. /rvd(F') > nM(<r(F')). 

3. F' € SAT (thus there is p G TASS with var(y>) = V and p * F' = T; this p 
is a non-trivial autarky for F ). 

Proof: Part [j] follows by definitions. For Part |] note that /ivd(F') < nM(tr(F')) 
implies pvd (F) < pvd( F') < nM(cr(F)) contradicting the assumption. And Part 
follows now by Lemma |9 . 7|. □ 



To better understand the background, we recall two fundamental facts regarding 
the surplus a(F) for multi- clause- set F with n(F) > 0: 

1. <j(F) together with some 0 C V C var(F) with cr (F) = <5(F[V]) can be 
computed in polynomial time (see Subsection 11.1 in |p7|). 

2. If o(F) < 0, then one can compute a non-trivial matching autarky for F in 
polynomial time (see Section 7 in |5l| or Section 9 in |57|). 

We see now that we can reach the conclusion of Theorem |9.8| for arbitrary 
inputs F in polynomial time, via some autarky reduction (maintaining satisfiability- 
equivalence) : 



Theorem 10.2 Consider a multi-clause-set F. We can find in polynomial time a 
sub-clause-set F' C F such that: 

1. There exists an autarky p for F with F' = ip * F . 
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2. If n(F') > 0, then cr(F’) > 1 and pLvd(F') < nM(cr(.F')). 



Proof: First F is reduced to the underlying clause-set (and all further compu- 

tations only handle clause-sets), and if JL € F, then we reduce F to {_L} and 
are finished. Otherwise, the reduction process of F yielding the final F' consists 
now of a loop of two steps: First eliminate matching-autarkies (i.e. , compute the 
matching-lean kernel), such that we reach ofF) > 1. Then apply the autarky- 
reduction according to Part of Lemma 10.1 (removing all clauses containing a 
variable of V) in case of pvd(F) > nM(cr(F)). This loop is aborted if T is reached 
or the criterions no longer applied. All autarkies are composed together (as shown 
in |5(J, also in general the composition of autarkies is again an autarky), yielding 
the final ip. □ 



In Theorem 10.2 we can only show the existence of an autarky p for F with 
F' = tp * F, however we currently do not know how to compute it efficiently. We 
conjecture that it can be found in polynomial time: 



Conjecture 10.3 For F £ CCS a >\ there is a poly-time algorithm for computing a 
non-trivial autarky p for F in case of pvd(F) > nM(cr(F)). 

Note that we ask only to find some autarky p, not necessarily one given by Lemma 
10.1 (i.e., with var(y:) = V as in Part of Lemma 10.1). That this is enough follows 
by the fact that the number of variables is reduced by such a reduction, and this 
by some autarky: 



Lemma 10.4 If Conjecture 10. S is true, then for the algorithm from Theorem 10. 



which reduces a multi-clause-set F to some (satisfiability-equivalent) F' C F, we 
can also compute an autarky p for F with F' = tp * F in polynomial time. 



Proof: In the loop as given in the proof of T heore m 10.2, we can replace the 
autarky-reduction according to Part || of Lemma 10.1 by th e red uction F tp * F 
according to a (non-trivial) autarky as given by Conjecture 10.5 . □ 



In the subsequent Subsection |10.2| we discuss what we know about Conjecture 



10.2 On finding the autarky 



Consider a matching-lean multi-clause-set F with n(F) > 0, where Lemma 10.1 



is 



applicable (recall that we have cr{F) > 1), that is, we have pvd (F) > nM(u(F)). 
So we know that F has a non-trivial autarky. Conjecture 10. S| states that finding 
such a non-trivial autarky in this case can be done in polynomial time (recall that 
finding a non-trivial autarky in general is NP-complete, which was shown in [filf ). 

The task of actually finding the autarky can be considered as finding a satisfying 
assignment for the following class MCC1Z C SAT C MCEAN of satisfiable(!) multi- 
clause-sets F, obtained by considering all F\V] for minimal sets of variables V with 
S(F[V]) = a(F) (where “CR” stands for “critical”): 



Definition 10.5 Let NiCCTE be the class of clause-sets F fulfilling the following 
three conditions: 

1. F g MCE AN, i £ F, F ± T. 

2. For all 0 C V C var(F) holds 5(F[F]) > cr(F). 

3. /rvd (F) > nM(cr(F)). 
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(The definition of MCCTZ just uses F £ MCE AN instead.) 

The basic properties of this class are collected in the following lemma: 



Lemma 10.6 For F £ MCCTZ holds: 

1. 6(F) = <j(F) > 1 ( whence F £ S£V) . 

2. F £ SAT . 

Proof: Since F £ MCE AN and n(F) > 0, we have a(F) > 1. By i ^ F we get 
F = F[ var(J r )1, and thus a(F ) = <5(F[var(F)]) = 6(F), while F £ SAT follows by 
Lemma ITTul . □ 

The examples we know for elements of MCCTZ are as follows: 



Example 10.7 A simple example for F £ MCClZs=i D TLTT is given by 



F := {{1, 2}, {-1, 2, -3}, {-2, 3}, {1, -2, -3}}. 

We have 6(F) =4 — 3 = 1 and fxvd(F) = 3; for a(F) = 1 and Condition [| of 
Definition \M notice, that any two variables cover all four clauses, and thus the 
minimum of (5(F[V]) is only attained for V = var(F); finally by p,vd(F) = 3 > 
nM(l) = 2 we get F £ MCCTZ, while F £ TUT by definition (any two clauses have 
a clash). 

This example also shows that MCCTZ is not invariant under multiplicities: Ob- 
tain F' from F by duplicating the first clause. We still have F' £ MCE AN . but 
6(F') = 2, while <J(-F'[{3}]) = 2 as well, and thus F' ^ MCCTZ . So duplicating a 
clause can lead outside of MCCTZ . In the other direction, removing a duplication, 
we have the following simple (counter-) example: Let F := {2 * {1}}; trivially we 
have F £ MCCTZ , but for the underlying clause-set we have {{1}} ^ MCE AN, 
thus it is not in MCCTZ. 

A more general class of example is obtained by full clause-sets. Let F be a full 
clause-set and n := n(F), m := c(F). Then F £ MCCTZ iff n < m < 2": 

1. We have 6(F) = m — n and thus 6(F) > 1 m > n. 

2. Furthermore F £ SAT -O- m < 2" . 



3. For 0 C V C var (F) we have <5(F[E]) = to — \ V\. Thus cr(F) = 6(F), and 
Condition || of Definition 10.5 is fulfilled. 



f. It remains to show the condition on the min-var degree: We have /rvd(-F) = m, 
while nM(a(F)) = nM(m — n). By Theorem 7.21 we obtain nM(m — n) = 
to — n + fld(m — n + 1 + fld(TO — n + 1)). We obtain for n > 1: 



/ivd (F) > nM(cr(f ;1 )) <t=> to > to — n + fid (to — n + 1 + fld(m — n + 1)) <t=> 
fld(TO — n + 1 + fld(m — n + 1)) < n <= 

M(2 n - 1 - n + 1 + fld(2 n - l- n+l))<n-» 
fld(2 n -n + fld(2" - n)) = fld(2 n - n + (n - 1)) = fld(2 n - 1) < n. 



Finally we note that the class MCCTZ is invariant against changes of polarities 
of literal occurrences ( if clauses become equal in this way, then their multiplicities 
have to be added), and thus for example replacing all clauses C £ F £ MCCTZ 
by their positive forms, var(C), we obtain a positive (no complementations occur) 
multi-clause-set F' £ MCCTZ (with c(F') = c(F) and n(F') = n(F)). 
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The importance of A iCCS is, that it is sufficient to find a non-trivial autarky for 
this class of satisfiable clause-sets. In order to show this, we need to strengthen the 
polytime computation of a(F ): 

Lemma 10.8 For a multi- clause- set F with n(F) > 0 we can compute in polyno- 
mial time a minimal subset 0 C V C var (F) with <5(F[F]) = <r(F). 

Proof: Let V := var(F). Check whether there is v G var(F) with a(F[V \ {u}]) = 
cr(F) — if yes, then V := V \ {u} and repeat, if not, then V is the desired result. □ 



We are ready to show that M.CCS is really the “critical class” for the pr oblem 
of finding the witness-autarky underlying the reduction F F' of Theorem |l0.2 



Theorem 10.9 Consider F G CCS with a(F) > 1 and pvd(F) > nM(cr(F)). 

1. For every minimal subset 0 C V C var(F) with d(F[F]) = a(F) we have 
F[V] G MCC1Z . 

2. Thus we can compute in polytime some 0 C V C var(F) with F[V] G MCC1Z . 



3. So Conjecture 10. i is equivalent to the statement , that finding a non-trivial 
autarky for clause-sets in AiCClZ can be achieved in polynomial time. 

Proof: Part [j] follows with Lemma [10. 1| . Part follows from Part [I] with Lemma 
10. S| . Part U follows with Part || (note that every autarky for some F[V] yields an 
autarky for F). □ 



Since MCC1Z C S£T>, if both questions of Question l.£ have a yes-answer, then 
this would prove Conjecture 10.4 



10.3 Final remarks on the surplus 



It is instructive to investigate the precise relationship between minimum variable- 
degree and the surplus for lean clause-sets, which by Corollary h£ are indeed very 
close. Small values behave as follows: 



Lemma 10.10 Consider F G C£AAf\ {T} (so <j(F) > 1 and pvd(F) >2). 

1. <r(F) = 1 holds if and only if pvd (F) = 2 holds. 

2. /ivd(F) = 3 implies <j{F) = 2. 

3. cr(F) = 2 implies /ivd(F) G {3,4}. 

4- nvd(F) = 4 implies a(F) G {2,3}. 

Proof: First consider Part [I] If <r(F) = 1 (so n(F) > 0), then by Theorem |9.8| 
we have pvd (F) < nM(l) = 2, while in case of /ivd(F) = 1 there would be a 
matching autarky for F. If on the other hand pvd (F) = 2 holds, then by definition 
<t(F) <2 — 1 = 1, while er(F) > 1 holds since F is matching lean. For Part || note 
that due to cr(F) + 1 < /xvd(F) we have ct(F) < 2, and then the assertion follows 
by Part ||; Part s follows in the same way. Finally Part follows by Part and 
nM(2) = 4. □ 



For some examples we use F with 6(F) = u(F): 



Example 10.11 Examples for cases a (F) G {2,3} in Lemma 10. 1L: 
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1. An example for /u,vd(F) = 4 in Part [| with F £ WHIT r\S£T> is given by A 2 . 

2. For { {a, b, c}, {a, b, c}, {a, b, c}, {a, b, c}, {a, c}, {a, c} } £ UHTT PASS'D we 
have /ivd(P) = 4 and cr(P) = 3 (Partfy). 

Question 10.12 is there for every k € N an F £ WHIT D S£T> with <j(F ) = k 
and /xvd(P) = k + 1 ? 

As we have for MU the levels MUs=k for k = 1,2, . we can consider for 

C£AM the levels C£AM a =k for k = 1,2, However, while the levels MUs=k 

as well as C£ANs=k all are decidable in polynomial time, already the first level 
C£A\f a =i is NP-complete: 



Lemma 10.13 Consider the map E : CCS — > CCS , which has E( T) := T, while 
otherwise for F £ C£<S\{T} it chooses (by some rule — it doesn’t matter) a clause 
C £ F and a variable v £ VA \ var(P), and replaces C by GU{r},CU{w}; in 
other words , an non-strict full subsumption extension F ^ E(F) is performed , as 
in Example 9.1t\. Then we have for F £ CCS: 



1. F £ C£AM iff E(F) £ C£AN. 

2. F £ MU iff E(F) £ MU. 

3. <j{F) < 1. 

Thus C£AAf a =i is coNP-complete, while MU a =i is D p -complete. 



Proof: The properties of the map E are trivial. The completeness-properties 

follow with the coNP-completeness of C£AM (|^l|) and the D p -completeness of 
MU (|80)). □ 



With Lemma 10.13 we also get easy examples for minimally unsatisfiable clause- 
sets of arbitrary deficiency and surplus 1. 



11 Matching lean clause-sets 



In this section, which concludes our conside rati ons on generalisations (beyond MU), 
we consider the question whether Theorem |9.8| can incorporate non-lean clause-sets. 

We consider the large class MC£AN of matching lean clause-sets, which is 
natural, since a basic property of F £ MU used in the proof of Theorem T8 is 
6(F) > 1 for F T, and this actually holds for all F £ MC£AN . We will construct 
for arbitrary deficiency k £ N and K £ N clause-sets F £ MC£AM of deficiency 
k, where every variable occurs positively at least K times. Thus neither the upper 
bound max(ldF(u), ld_F(u)) < f(S(F)) nor ldir(v) + ldF(u) = vdF(v) < f(6(F)) for 
some chosen variable v and for any function / does hold for MC£AN . 

An example for F £ MC£AJTs=i with /tld(P) > 2 (the minimal literal degree; 
and thus fivd(F) > 4) is given in Section 5 in Q, displaying a “star- free” (thus 
satisfiable) clause-set F with deficiency 1. In Subsection 9.3 in J57j] it is shown that 
this clause-set is matching lean. “Star-freeness” in our context means, that there 
are no singular variables (occurring in one sign only once). Our simpler construction 
pushes the number of positive occurrences arbitrary high, but there are variables 
with only one negative occurrence (i.e., there are singular variables). 

For a finite set V of variables let M(V) C A(V) be the full clause-set over 
V containing all full clauses with at most one complementation; e.g. M({ 1,2}) = 
{{ 1 , 2 }, {- 1 , 2 }, { 1 ,- 2 }}: 
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1. Obviously n(M(V)) = |Vj, c(M(V)) = \V\ + 1 and 8(M(V)) = 1 holds. 



2. We have already seen that M(V ) G MLS AN (for T ^ F' C F C A(P) we 
have d(F') < (5(f), and thus a full clause-set F is matching lean iff 8(F) > 1). 

3. By definition we have ldjif (v) C^) = \V\ and ld^f(y)(v) = 1 for v GV. 

Lemma 11.1 For k G N and I\ G N there are F G MLSANs=k with F G US AT 
for k >2 such that for all variables v G var(F) we have ld^(u) > K . 

Proof: For k = 1 we can set F := M({v i, . . . , Vk})', so assume k > 2. Consider 
any clause-set G G MUs=k-i with n := n(G) > K , and let V := var(G). Consider 
a disjoint copy of V, that is a set V' of variables with F'flb = 0 and \V'\ = \V\, 
and consider two enumerations of the clauses M(V) = {Ci, . . . , G„+i}, M(V') = 
{C [, . . . , C' n+1 }. Now 

F:=Gd{C i dC':iG{l,... ) n+l}} 

has no matching autarky: If tp is a matching autarky for F, then var (ip) C V = 0 
since G is matching lean, whence var(</?)nP' = 0 since M(V') is matching lean, and 
thus tp must be trivial. Furthermore we have n(F ) = 2 n and c(F) = c(G) + n + 1, 
and thus 5(F) = c(G) + n + 1 — 2n = 8(G) + 1 = k. By definition for all variables 
v G var(P) we have ldfC^) > n. □ 

For k = 1 the examples of Lemma |1 1 . l| for K > 3 are necessarily satisfiable, since 
MCSANs^i Ci US AT = MUs=i- It remains the questions whether the singular 
variables can be eliminated: 

Question 11.2 ^4re there examples for deficiency k G N of F G MC£AAfs-k with 
p\d(F) >k + l ? 

1. The above mentioned star- free clause-sets shows that this is the case for k = 1. 

2. What about the stronger condition p\d(F ) > K for arbitrary K G N ? 



12 Lower bounds for the min-var-degree of M.U 

We now return to (boolean) minimally unsatisfiable clause-sets. By Theorem |S.3| 
we have p,vd(A4Us-k) < nM(fc) for all k G N. The task of precisely determining 
pvd(MUs = k) for all k seems a deep question, and is the subject of the remainder 
of this report. First we introduce a notation for the true bound on fivd(AiUs=k)' 

Definition 12.1 For k G N let /xnM(fe) := /rvd(A iUs=k) G N (the “minimum 
non-Mersenne number” for (deficiency) k). 

So by Theorem |S.3| we have /inM < nM. All our examples yielding lower bounds 
on /rnM(fc) are actually (unsatisfiable) hitting clause-sets, and thus we believe 



Conjecture 12.2 For all k G N holds /mM(fc) = pvd(U'HITs=k) ■ 



We will see in Theorem 14.3 that /mM ^ nM. We believe that /inM is a highly 
complicated function, but the true values deviate only at most by one from nM: 



Conjecture 12.3 For all k G N we have /inM(fc) > nM(fc) — 1. 



By Corollary 7.22 we get: 
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Lemma 12.4 If Conjecture 12. S holds, then k — 1 + fld(fc + 1) < /jnM(fc) < k + 
1 + fld(fc) holds for k £ N. 



Later in Lemma 13.1 we will see that /jriM : N — > N is monotonically increasing. 



While in Theorem 14.5 we will (implicitly) construct a correction function ji : N — > 
{0, 1} such that /iriM < nM — 71 , where we remark in the Conclusion (Section |l5|) 
that also /niM ^ nM — 71 holds. Note that Conjecture 12.3 says that there exists 
7 : N — > {0, 1} with pnM = nM— 7 , while for every 7 : N — ► {0, 1} the function 
nM — 7 is still monotonically increasing (by Lemma 7.5), and is thus a possible 
candidate. 



In Subsection 12.1 we provide a general method for obtaining lower bounds , via 
considering full clauses (while in Section [bi| we turn to improved upper bounds). 
Namely we introduce iAc(MUg = k) £ N for k £ N, the maximal number of full 
clauses in F £ MU$ =k . According to our numerical investigations the number of 
full clauses is very close to pnM. and indeed to nM(i): 

Conjecture 12.5 For all k £ N we have iAc(MU$ = k) > nM(fc) — 1. 



Conjecture 12.5| implies Conjecture 12.3 ; regarding AcfUHTTs^k ) , there might be 
unbounded gaps to iAc(MUg = k) ■ The smallest deficiency k with ufc(AiUg~k) = 
nM(fc) — 1 (and also ^fc(A lUg-k) = /unM(fc) — 1) is k = 3, as shown in Lemma 
12.16| (together with Theorem 14.3). We show for two infinite classes of deficiencies 
k that vidjUUITg^k) = /JiiM(fc) = nM(fc) holds (Lemmas |12.10| , 12.11). Actually, 
the main point here could be considered as (just) the equalities //liM(A') = nM(fe), 
for which in these two cases the proofs don’t need to consider full clauses, and so the 
general method for computing lower bounds on uic(MUg-k), with the beginnings 
developed in Subsection 12.2, is not applied here. However in future work we will 
employ this method more fully (see Subsection 15. 3|), and, more important for the 
report at hand, we need for the proof of Theorem 14. 3| (that fmM( 6 ) = nM( 6 ) — 1 = 
8 ) the fact vic(M.Ug—^) < 4, shown in Lemma 12.16|. 



12.1 Some precise values for the min-var-degree of A iU 

A general lower-bound method for pnM is provided by the number fc(F) of full 
clauses in a clause-set F. The supremum vic(JCUJs—k) of this number over all el- 
ements of JAld s=k for fixed k is an interesting quantity in its own right, but in 
this report we only touch on this subject, providing the bare minimum of informa- 
tion needed in our context. See Subsection 15.3 for an outlook on the interesting 
properties of this quantity. 



Definition 12.6 For a clause-set F £ CCS let fc(f) £ No be the number of full 
clauses, that is fc (F) := \{C £ F : var(C) = var(F)}|. And for a class C C CCS 
of clause-sets we define fc(C) := (fc(F) : F £ C} C No as the set of all possible 
numbers of full clauses, while ufc(C ) £ No U {+ 00 } is the supremum ofic(C). 

Some simple examples: 



Example 12.7 fc(T) = 0, fc({_L}) = 1 , and fc ( { { 1 } , ( — 1, 2}}) = 1 . While fc(0) = 
0, thus i/fc(0) = 0, and fc (CCS) = No, thus vic(CCS) = + 00 . 

By definition we have: 

Lemma 12.8 fc (F) < /rvd(E) holds for every F £ CCS (since every variable in F 
has degree at least fc (F)), and thus ^fc(C) < /xvd(C) for every C C CCS. 
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We obtain that for lean clause-sets (especially minimally unsatisfiable clause-sets) 
of fixed deficiency the number of full clauses is bounded: 



Corollary 12.9 A lean clause-set of deficiency k can have at most nM(fc) many 
full clauses; i.e., for all k £ N we have vic{C£ AN s=k) < nM(fe). 

Precise values for z/fc (MU$=k) = /unM(fc) we show for two infinite classes of 
dehcien cies. The simplest class are the deficiencies directly after the jumps (recall 
Lemma m , the deficiencies of the A n : 

Lemma 12.10 For n £ N and k := 2 n — n holds 



vfc(UmTs=k ) = vtc{MUs=k) = in,vd(UWTs=k) = /unM(fc) = nM(fc) = 2 n . 

Proof: We have v ic(A n ) = 2" (recall Lemma HI), and thus vicfUfiTTs=k) > 2 n , 
while by Corollary |7.23| we have nM(fc) = 2 n . □ 

Also for the jumps themselves (recall Definition 7. 12| and Lemma 7.20) the same 
conclusions hold, namely by Lemma pjq, Part 0 (and the proof) we have: 



Lemma 12.11 For all k £ J holds 



uic{1MLTs=k) = vic(MU 5 = k ) = n\&{UWT&=k) = /mM(fc) = nM (jfe). 



Note that for k £ J there is n £ N, n > 2, with k = 2 n — n — 1 and nM(fc) = 
2" — 2. The underlying method of Lemmas 12. 1C, 12.11 is simple (as we already 
have explained it in Subsection 1.3): start with A n and apply strict full subsumption 
resolution to full clauses. Zero steps have been used in Lemma 12.10, one step in 
Lemma 12.11, and one example for two steps will be seen in the proof of Theorem 
14.1. The further development of this method we have to leave for future work. 



12.2 On the number of full clauses 

We have a special interest in those F € A4U where the lower bound ic(F) meets the 
upper bound iavd(F). In this case this number must be even, and we obtain another 
F' £ MU by resolving on any variable realising the minimum variable degree: 

Lemma 12.12 Consider F £ MU with fc (F) = /rvd(T'). Then fc (F) is even. 

Proof: Consider v £ var (F) with vdi?(u) = fivd(F). The occurrences of v are now 
exactly in the full clauses of F. Every full clause C must be resolvable on v with 
another full clause D, yielding E := CoD, and thus the full clauses of F can be 
partitioned into pairs {i>} U E, {u} U E (disjoint unions) for fc ^ many clauses E 
(of length n(F) — 1; note that because of fullness for a given E the clauses C, D are 
uniquely determined up to order). □ 

Thus, if lower and upper bound match, they must be even numbers: 

Corollary 12.13 If vie (MU$=k) = nM(fe) or vfc(MUs=k) = ^vd(MUs=k) for 
some k £ N, then vic(MUs-k) is even. 

Another property of ic{MU$=k) related to evenness is that if to is a possible 
number of full clauses, then 2 to is a possible number for 6 = k + m — 1: 

Lemma 12.14 2 to £ ic(MUs=k+m—i) for k £ N and m £ {1, ... , nic(MUs=k)} ■ 
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Proof: Consider F G MUg=k with fc(F) = vic(MUs=k)- Choose m of the full 
clauses of F, and choose a new variable v var(F). Replace each of the chosen full 
clauses C G F by two clauses CU{v}, Clil{u} (one non-strict and to — 1 strict full 
subsumption extensions), obtaining F' . We have F' G M.lig=k+m- 1 and fc(F') = 
2 to . □ 



As a special case we obtain that 2, 4 are always possible for the number of full 
clauses (except for k = 1): 



Corollary 12.15 For k G N holds 2 G fc(Al£4 = fc), and if k > 2 f/ien 4 G 
fc (MU s = k ). 



Proof: We show the assertion by induction over k , using Lemma [l 2. 14 , as follows: 
We have {{1}, { — 1}} G MU$=i, so consider k > 2. We know 2 G fc(A4l&=fc_i), 
thus 4 G fc(A4Zl5 = fc_i + 2-i=fc)- And once we have any F G MUs=k with a full clause, 
we get F' G MUs=k with fc(F') = 2 by performing a non-st rict fu ll subsumption 
extension on that full clause (i.e. , F' = E(F) as in Lemma 10.13 , with C a full 
clause) . □ 



We now turn to the determination of vic(M.U$=k) for k = 1,2, 3. 



Lemma 12.16 We have: 



1. vic(MUs=\) = 2 = nM(l). 

2. vic{MU s = 2 ) = 4 = nM(2). 

3. vic{MU s = 3 ) =4 = nM(3) - 1. 



Proof: Part |]: Between two clauses of some F G A4U$=i there is at most one 



conflict, and thus there are at most two full clauses in F. while by Corollary 12.15 
we know uk(MUs=i) > 2. Part By Corollary 12.15 we have uic(A4U, 5=2) > 4 
by Corollary [12. 9| we have pic(MUs= 2 ) < 4. Part |3j: By Corollary 12.15| we have 
pic{MUs^a) > 4, by Corollary 12.13 we have ^fc(A4Z4 = 3) <4. □ 



13 A method for improving the mvd upper bound 
for MU 

We now present a framework for generalising the argumentation of Theorem |S.3| 
together with the analysis of the underlying recursion from Section [t|. The idea is 
as follows: 

1. We start with upper bounds pnM(fc) < ak for k = 1 collected in a 

“valid bounds- function” /. 

2. For deficiency p + 1 and an envisaged min-var-degree to we consider the set 
PP/(P+ 1,to) of “possible” degree-pairs of variables (the degrees of the pos- 
itive and negative literals) in an envisaged clause-set F G SMUs= p +i with 
pvd (F) = to. 

3. If pp f(p + 1>to) = 0 , then to is “inconsistent”, that is, impossible to realise 
(as shown in Theorem [13. 1C| ) , whence //nM (p + 1) < to. 

4. While in case of pp^(p + l,m) 7^ 0 there might exist such an F or not (the 
formal reasoning underlying the definition of ppf{p + 1, m) is not complete). 
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So here we generalise the approach of Section [?] for describing the function nM to a 
general recursion scheme, obtaining a general method for improved upper bounds. 
The applications in this report are as follows: 



In Theorem |13.15 we obtain an alternative description of nM(fc). 



In Section E we will first show that the smallest fc, where we don’t have 
equality, is k = 6, namely p.vd(MUs=e) = 8 = nM(6) — 1 (Theorem 14.3). 



By the general recursion scheme then follows from this improvement of the 
upper bound, that for all k = 2 m — m + 1 for m > 3 we have //vd (MUs=k) < 
nM(fc) — 1. This improved upper bound is denoted by by nMi : N — ► N 
(Theorem |4j). 



13.1 Analysing splitting-situations 

“Valid bounds-functions” shall be monotonically increasing — we know that nM 
is (strictly) monotonically increasing, and we show that //vd is also monotonically 
increasing (not strictly, as we will later see in Theorem |l4.3|) : 



Lemma 13.1 The map //nM is monotonically increasing (//nM (fc) < //nM(fc + 1) 
for k gN). 



Proof: For F G MUs=k, n(F) ^ 0, we can construct F' G MUs=k+i with 

//vd (F) < //vd(F') as follows: 



1. If F is full, then obtain a non-full F" g M Us=k with //vd (F) = //vd (F") by 
a full singular unit-extension (Lemma 5.17), and replace F by F" . 



2 . 



If F is not full, then perform a strict full subsumption extension (Lemma |6.5|) , 
obtaining the desired F' . □ 



We define now “valid bounds-functions” , which are sensible as upper bounds on 
//nM, and we also define how to obtain such a function from initial upper bounds 
//nM(fc) < ak for k = 1, . . . ,p: 



Definition 13.2 A valid bounds-function is a function f : N — > N U {+oo} 
fulfilling the following three conditions: 



1 ■ /( 1 ) = 2 . 

2. f is monotonically increasing (i.e., Vfc, k' G N : k < k' =>• /(fc) < /(fc')}. 

3. f(k) is an upper bound for the minimal-variable degree of minimally unsatis- 
fiable clause-sets of deficiency k (i.e., Vfc G N : /znM(fc) < f{k)). 



The set of all valid bounds-functions is denoted by VB C N N . And by VB* := {/ G 
; / < nM} we denote the set of valid bounds-functions (pointwise) less-or-equal 
than the non-Mersenne function. 

For ai,...,a p GN,pGN. such that a i = 2, <// < aj for i < j, and ai > //nM(i), 
we define [ai, . . . , a p ] as that f g VB with f(k) = ak for k g {1, . . . ,p}, while 
f(k) = oo for k > p. 



By Lemma 13.1, //nM is a valid bounds-function, namely the smallest possible one. 
By Theorem H and Corollary |7.6| also nM is a valid bounds-function. In Corol- 
lary 13.16 we will see, that the continuation of [ai, . . . , a p ] by just oo is harmless, 
since nM is automatically taken into account via the improvement of valid bounds- 
functions through the use of potential degree-pairs defined below. 
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Lemma 13.3 VB as well as VB* , together with <, is a complete lattice, where 
infima resp. suprema are given by pointwise minimum resp. pointwise supremum. 
The smallest elements of both lattices is /jnM, while the largest is [2] resp. nM. 

The following definition reflects the main method to analyse and improve a given 
upper bound / for /mM(fc), namely it determines the numerical possibilities com- 
patible with /: 



Definition 13.4 Consider k,m G N with k > 2 and m > 4, together with a valid 
bounds-function f . The set of potential (variable-) degree-pairs w.r.t. f for 
( deficiency ) k and (minimum variable- degree) m, denoted by pp^(fc,m), is the set 
of pairs (eo,ei) G N 2 fulfilling the following conditions: 

(i) e 0 ,ei > 2 

(ii) eo, ei < k 
(in) eo + ei = m 
(iv) eo < ei 

(v) Ve G {0, 1} : / (k - e E + 1) + e E > m. 



We set pp(fc, m) := pp MnM (fc, rn). 

The motivation for Definition |l3.d is to assume F G SMUs=k with pvd(F) = m 
and v G var MVC ](_F), and to determine the possible literal-degrees eo = ldi?(u), 
ei = ldF(u), “possible” in a formal sense, “e” stands for “eliminated clauses”, 
namely e E is the number of clauses eliminated by ( v — > e). The “high” values of m 
(for fixed k ) are of real interest; compare Lemma 13.7 . The basic properties of pp^ 
are as follows: 



1. For every valid / and k > 2 we have pp^(fc,4) = {(2,2)} and pp f(k,m) = 0 
for m > 2k. 



2. Discussion of the five conditions (i) - (v) in Definition 13.4: 



(i) Only non-singular variables are considered, since only in this way the 
deficiency strictly decreases. 

(ii) The deficiency of F e := (v — > e) * F is k e := k — e e + 1 (assuming we 
split on a variable with minimal degree), and we must have k e > 1. 

(iii) eo, ei are the literal-degrees of v, v, which sum up to the variable-degree 
m of v. 

(iv) W.l.o.g. we can restrict attention to such degree-pairs, since F plays a 
role only up to isomorphism, and thus one can flip the sign of v in F. 

(v) We have F s G MUs=k-e e +i (assuming F is saturated). And for w G 
var (F s ) we have vdF(u;) < vdF e (w)+e e . If for some e G {0, 1} we would 
have p,vd(A4Us=k-e e +i) + e s < m, then for w G var Atv( j(i 7, E ) we would 
have vd f{w) < vd F e {w) + e E < gvd(AilJs=k-e e +i) + e E < m, but by 
assumption on w we have vdF(w) > m. 



3. An important special ca se of pp^(fc, to) is pp nM (fc,m); we have pp(fc,m) C 
PPnivi(k; TO ) ( see Lemma 13. 8| for a generalisation). The main point in using 
functions / is that the precise values of /.tnM (k) might not be known. 



4. To compute pp^(fc, to) according to the definition, only the values f(k') for 
k' G {1, . . . , k — 1} are needed. 



70 



Example 13.5 Consider f := [nM(l), nM(2), nM(3)] = [2,4,5]. First we deter- 
mine pp^(4, 7): 



1. By Conditions (i) - (iv) only {(3,4)} remains. 



2. Now /( 4-3 + 1) + 3 = /( 2) + 3 = 7, but /(4 - 4 + 1) + 4 = /( 1) + 4 = 6 < 7. 

3. Thus pp^(4, 7) = 0. 



We will see in Theorem 13. 1C that from this we can conclude /mM(4) < 6 (there is 
no “formal” possibility to reach the min-var degree of 7 for deficiency 4}. Now we 
determine pp^(4, 6): 



1. By Conditions (i) - (iv), {(2, 4), (3, 3)} are the possibilities. 



2. Checking Condition (v) for (2,4): /( 4 — 2 + 1) + 2 = /( 3) + 2 = 7, /( 4 — 4 + 
1) + 4 = /(l) + 4 = 6. 

3. Checking Condition (v) for (3, 3): /( 4 — 3 + 1) + 3 = /( 2) + 3 = 7. 



^ Thus PP/ (4, 6) = {(2, 4), (3, 3)}. 

The intuitive meaning of this is, that a min-var- degree of 6 can not be excluded by 
this type of formal reasoning, and 6 is the first new value according to this reasoning 
for [2,4,5]. 



We invite the reader t o com pute the following special case of what we show later 
(in the proof of Theorem 14.6; it might also be useful to consider Table [ 2 J) : 



Example 13.6 pp nM (13, 17) = {(8, 9)}, while for any valid bounds-function f with 
f(k) = nM(fc) for k £ {1, . . . , 5} and /( 6) = nM(6) — 1 = 8 holds ppj(13, 17) = 0. 

If we have a potential degree-pair for m, then also for m! < m: 



Lemma 13.7 Consider k,m,m' £ N with k > 2 and 4 < m' < to, and consider a 
valid bounds-function f. If ppj(k,m) ^ 0, then also ppj(k,m') ^ 0. 

Proof: Consider (eo,ei) £ ppf{k,m). Consider any 2 < e' 0 < eo and 2 < e\ < e\ 
with e' 0 < e( and e' 0 + e[ = m' . Now f(k — e' + 1) + e' > f(k — e e + 1) + e' e = 
f(k — e £ + l) + e £ — e e +e' e > m — e e +e' e = m—(m—ee) + (m' —eh) = e^+ml — eh > m’ 
for e £ {0,1}, and thus (e^e}) £ ppj(fc,m') ^ 0. □ 



Using a smaller bou nds-fu nction can not yield more potential degree-pairs, as is 
obvious from Definition 13.4: 



Lemma 13.8 Consider k,m £N with k > 2, m > 4, and valid bounds-functions 
/ij/2 with fi < fi (pointwise). Then ppj 1 (fc,m) C PP f 2 (k,m). Especially for any 
valid bounds-function f holds pp (k,m) C pp f(k,m). 

Again directly by definition (using monotonicity of valid bounds functions) we 
get that increasing k while keeping m can not remove potential degree-pairs: 



Lemma 13.9 Consider k,m £ N with k > 2, m > 4, and a valid bounds-function 
f. Then ppj(fc, m) C pp^(fc + 1 , to). 

The main use of potential degree-pairs is to provide upper bounds on /rnM(fc): 

Theorem 13.10 Consider k,m £ N with k > 2, m > 4, and a valid bounds- 
function f. If pp f(k,m) = 0, then /inM(fc) < m. 



71 



Proof: Assume pwd{M.Us=k) > m. Then there is F £ SMU' s ^ k with /ivcl(F) > m 
(using Corollary |A5| ). Consider v £ var MVC j(-F); if ldi?(w) < ld_F(v) holds, then let 
e := (Id jr’(v), ldi?(ri)), while otherwise flip the components of this pair. Now we have 
e € pp(fc, fivd(F)) (using Remark |]to Definition |13.4| ), and thus pp^(fc,m) ^ 0 by 
Lemmas 13.7, 13.8, contradicting the assumption. □ 



13.2 Recursion on potential degree-pairs 

Via potential degree-pairs and Theorem |13.10| , we obtain a method for improving 
valid bounds-functions: 



Lemma 13.11 Consider f £ VB. We obtain f £ VB recursively as follows: 



1 . /'( 1 ) := 2 . 

2. For k > 1 consider the largest 4 < m < 2 k such that pp j,(k,m) ^ 0, using 
Remark Q to Definition 13. \ (that we only need f'(k ' ) for k' < k). 



3. Now f'(k) := min (m,f(k)). 



Proof: f'(k) is well-defined for k > p due to Remark |l] to Definition 13.4. That 
/' is valid follows by induction as follows. We have to show f'(k) < f(k + 1) and 
/inM(fc) < f'(k) for all k £ N. For k = 1 both properties are true by definition. 
And the induction step follows for monotonicity by Lemma 13. 9| . and for the upper- 
bound-condition by Theorem 13.10. □ 



The mapping / £ VB i— )• f £ V£> we call the “non-Mersenne operator”: 



Definition 13.12 For f £ VB let the f £ VB according to Lemma 13.1 (j be de- 
noted by NM(/) := /' (defined via “recursion on potential degree-pairs”); we call 
NM : VB — > VB the “non-Mersenne operator”. 



The basic properties of the non-Mersenne operator are that of a kernel operator , 
which are order-theoretic properties as follows: 



Lemma 13.13 The map NM : VB — > VB is a kernel operator of the complete 
lattice VB, that is, for all f,g£ VB holds: 

1. NM(/) < / (intensive) 

2. NM(NM(/)) = NM(/) (idempotent) 



3- f < g => NM (/) < NM(g) (monotonically increasing). 



Proof: Intensitivity follows by definition of NM (note that in Lemma 13.11 



we 



have defined f'(k) such that f' (k) < f(k) holds). Also idempotence follows directly 
from the definition in Lemma 13.11, namely that f{k) for k > 1 already uses the 
improved values f'(k') for k' < k. Monotonicity follows by Lemma 13. S|. □ 



By Lemma 13.13| we get that NM(/) for / £ VB is the supremum of the set of 
f < f with NM(/') = f. By Theorem 13.10 we get NM(/rnM) = /rnM. In order 
to show that the non-Mersenne operator at most reproduces nM, that is, for all 
/ £ VB holds NM(/) < nM, we need to provide potential degree-pairs for nM: 



Lemma 13.14 For k > 2 we 



have (recall Definition l.li\): 



1. (h(k),i nM (k)) £ pp nM (fc,nM(fc)). 
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2. pp nM (fc, nM(fe) + 1) = 0. 



Proof: Let m := nM(fc). For Part [l] let eo := h(k), e\ := i n M (A;); so we have to 
show (eo,ei) G pp nM (fc,m). Consider the conditions (i) - (v) in Definition 13.4 . We 
have eo > 2, s ince nM > 2 in general, and ei > 2 by Definition |7.8| . As shown in 
Corollary 7.11 we have e p < e i, where e± < k by definition. Furthermore we have 
eo + ei = m by Lemma 7. 1C , Part [| Altogether we have now shown conditions 
(i) - (iv), and it remains to show that nM(fc — e E + 1) + e E > m holds for both 
e G {0, 1}; for e = 1 we have equality, as already remarked, and it remains to show 
nM(fc — eg + 1) + eg > to, which is equivalent to 



nM(fc - e 0 + 1) > i„M(fc)- 



By Definition |7.8| of i n M {k) (as the smallest i) this is implied by nM(fc — eo + 1) > 
nM(A: — nM(fc — eo + 1) + 1). By the monotonicity of nM this is implied by eo < 
nM(fc — eo + 1), i.e. , nM(A: — i n M (fc) + 1) < nM(fc — e 0 + 1). Again by monotonicity, 
this is implied by i n M(fc) > eo, he., e i > eo, which we have already shown. 

For Part || we have to show pp nM (fc, m + 1) = 0. Assume that we have (eo, ei) G 
PPnM (^5 m + 1) according to Definition Thus we have nM(fc — ei + 1) + ei > 
m+1, where 2 < e\ < k. Because of eo+ei = m + 1 and eo < ei we get e\ > ^(m+1), 
whence min(2-ei, nM(fc- ei + l) + ei) > m+1, and thus nM(fc) > m+1 by Definition 



7.1. 



□ 



We obtain an alternative recursion for nM(fc) (recall Definition 7.1): 

Theorem 13.15 NM([2]) = NM(nM) = nM. 

Proof: By Definition 13. 12| and Lemma 13.14 we get NM([2]) = nM. Since NM is 
idempotent, we also get NM(nM) = nM. □ 



So the non-Mersenne operator yields nM in the worst-case: 

Corollary 13.16 NM : VB — > VS*, that is, for every f G VS holds NM(/) < nM. 



14 Strengthening of the mvd upper bound for M.U 



In this final section many techniques introduced in this report come together, and 
we give some initial sharpness results (considering small deficiencies), and some 
non-sharpness results in the form of improved bounds (improving nM for infinitely 
many deficiencies). In Subsection 14.1 we determine /rnM(fc) for 1 < k < 6 as values 
2, 4, 5, 6 , 8 , 8 , where the main achievement is Theorem 14.3, showing /jnM( 6 ) = 8 = 
nM( 6 ) — 1. Applying the non-Mersenne operator, we obtain the improved upper 
bound /mM(fc) < nMi(fc) in Subsection 14.2, where nMi is like nM, but with a 
duplication after the jump positions, that is, AnM(fc) = AnMi(fc) = 2 is followed 
by AnM^fc + 1 ) = AnM(fc + 1) — 1 = 0. 



14.1 Deficiencies 1, . . . , 6 

We now show that the first deficiency k, for which the bound p.nM(fc) < nM(fc) is 
not sharp, is k = 6 . First we show sharpness for the first five values: 



Theorem 14.1 For k G {1, . . . , 5} we have /rnM(fc) = /j,vd(WHITs=k) = nM(fc). 



Proof: We have to give examples s howing that the upper bound nM(fc) is attained 
for examples in WHIT'S— k)- Lemma 12.1Cl| covers deficiencies k = 1, 2, 5, namely 
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1. Ai G WHFTs=i has fc(Ai) = 2 = nM(l) (recall Example 

2. A 2 G WHITs = 2 has fc(A 2 ) = 4 = nM(2) (recall Example [TTT^ ) . 

3. A 3 g UmTs =5 has fc(A 3 ) = 8 = nM(5). 



Deficiency k = 4 is a jump position, and thus covered by Lemma 12.11, where the 
example is as follows: 

4. For F 4 := {{1, 2}, {-1, 2, 3}, {1, -2, 3}, (-1, -2, 3}, {-1, 2, -3}, {1, -2, -3}, 

{ — 1, —2, —3}} we have F 4 G U'HFTs= 4 with fc(F 4 ) = 6 = nM(4). 

The remaining case k = 3 we obtain via strict full subsumption resolution from F 4 : 

5. For F 3 := {{1, 2}, (-1, 3}, {1, -2, 3}, {-1, 2, -3}, {1, -2, -3}, {-1, -2, -3}} 

we have F 3 G U'hLITs= 3 with /ivd(F 3 ) = 5 = nM(3). □ 



We note that example F 3 in the proo f of Th eorem 14.1 shows vic{U r HXT^ 3 ) = 
v{c{MUs= 3 ) = 4 (together with Lemma 12.16| > Part |). In the sequel of this sub- 
section we consider k = 6 . A computation shows that there is only one potential 
degree-pair for the min-var-degree as given by nM( 6 ) = 9: 

Lemma 14.2 pp nM (6,9) = {(4,5)}. 



Proof: Conditions (i) - (iv) of Definition |13. j| yield pp nM ( 6 , 9) C {(3, 6 ), (4, 5)}. 
Condition (v) excludes (3,6), since we have /xvd(A4Wa=6-6+i) + 6 = 8^9, while 
(4, 5) fulfils this condition due to nM( 6 — 4+l)+4 = 5+4 > 9 and nM (6 — 5+l)+5 = 
4 + 5 >9. □ 



However, the potential degree-pair of Lemma 14. 2| actually can not be realised, 
and thus /mM( 6 ) < nM( 6 ): 

Theorem 14.3 vic{UUlT 4=e ) = vIc{MU^q) = fJLvd(UHlTs= 6 ) = /m M( 6 ) = 8 = 
nM( 6 ) - 1. 

Proof: ide(Z+HT75=6) > 8 is confirmed by the variable-clause matrix 



/+ + + - 
- + + + 
- + - + 
v+ - + + 



+ -\ 



+ - - 

- 0 0 

- 0 0 



(where unsatisfiability is given by 8-2 4 + 2- 2 2 = 1). 

Assume now that there exists F G MUs = q with /rvd(F) = 9. By Lemmas 
aa w.l.o.g. we can assume that F is saturated and non-singular. By Theorem 
3+| we know n(F) > 4. Consider v G var(F) with = 9. W.l.o.g. we 

assume ldir(w) > ldp’(T). By Lemma 14.2 we have Hf^) = 5, ld_F(T) = 4, and 
S(F 0 ) = 6- 4 + 1 = 3, S(F 4 ) = 6- 5 + 1 = 2. 

Let the 5 occurrences of v in F be Ci, . . . , C 5 G F, and let C[ := Ci \ {n}. And 
let the 4 occurrences of v in F be D 4l . . . , D 4 G F, and let D\ := Di \ {w}. Using 
G := {C G F :v £ var(C)} = F\ {Ci, . . . , C 5 , D 4 , . . . , £> 4 } we get 

F 0 = {C{,...,C'}CG 
F 4 = {D[,...,D' 4 }UG 



where c(Fq) = 5 + c(G) = c(F) — 4 and c(F 4 ) = 4 + c(G) = c(F) — 5. 
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Consider first Fp G MU$ = 3. We have /rvd(F 0 ) > 9—4 = 5, and thus /rvd(F 0 ) = 5 
(due to /xvd(-Fo) < nM(3) = 5). Every variable w G var(Fo) realising the min-var- 
degree of Fp has at least 9 occurrences in F, from which at most 4 are eliminated, 
and thus actually such variables have vdidw) = 9, and furthermore w G var (Di) 
for all i G {1, ... ,4}. By Lemma |8.5| , Part [l|, there exist two different variables 
u\,v ,2 G var(F 0 ) with vd_F 0 (wi) = 5 for i G {1,2}, and so we have \Di\ > 3 for all 
* e {l, - - - ,4}. 



Now consider F\ G We have /zvd(Fi) > 9 — 5 = 4, thus /xvd(Fi) = 4 

(due to /xvd(Fi) < nM(2) = 4), and thus by Lemma 5.13 F\ is non-singular iff F\ 
does not contain unit-clauses. If F\ would contain a unit-clause, then there would be 
a binary clause {u, x} G F, contradicting that all Di contain at least three literals. 
So -Fi is non-singular, and thus F\ is isomorphic to some J- m for some m > 2. So 
F\ is 4- variable-regular, where all the variables of F\ have at least 9 occurrences in 
F, and thus we have var(Fi) C var(C') for all i G {1, . . .,5}, which implies that 
actually var(C') = var(Fi) = var(F 0 ) holds. 

Coming back to the structure of Fp, we now know that Fp has five full clauses 
C \ , • • • , C'p , which contradicts Lemma 12.1C , Part ||. □ 



14.2 Sharpening the bound 



Based on recursion on potential degree-pairs, we can i mpro ve the upper bound 
nM(&) for ^vd{A4Us=k) for k > 6 (generalising Example 13. (i|): 



Definition 14.4 Let nMi : N — > N be defined as nMj := NM([2, 4, 5, 6, 8, 8]) (recall 
Definition 13.1i). 



By Lemma 13.11 together with Theorem 14.3 we get: 



Theorem 14.5 For all k G N we have /znM(fc) < nMi(fc). 

It remains to determine nMi numerically: 

Theorem 14.6 In Table [| we find the values of nMi(fc) for k < 30. We have 
nM]^ (A;) = nM(fc) for k ^ {2 m — m + 1 : m G N, m > 3}, while for k = 2 m — m + 1 
we have nM^/c) = nM(fc) — 1 = 2 m . 



k 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


11 


12 


13 


14 


15 


nMi(fc) 


2 


4 


5 


6 


8 


8 


10 


11 


12 


13 


14 


16 


16 


18 


19 


k 


16 


17 


18 


19 


20 


21 


22 


23 


24 


25 


26 


27 


28 


29 


30 


nMi(fc) 


20 


21 


22 


23 


24 


25 


26 


27 


28 


29 


30 


32 


32 


34 


35 



Table 3: Values of nMi(fc) for k G {1, . . . , 30}, in bold the jump- values (i.e., k G J), 
and underlined the changed values compared to nM(fc); we see that directly after 
the jump we have stagnation, followed by a second jump. 

Proof: We show the formula for nMi(fc) via induction on k. Due to 2 3 — 3 + 1 = 6 
it holds for k < 6. So assume k > 7. We show by induction on k > 7 the following 
two properties (which imply the assertions of the theorem): 

1. For k = 2 m — m + 1, m > 4, we have 

(a) PP nMl (k, nM(fc)) = 0 and 

(b) and pp nMl (fc, nM(fc) -1)^0. 
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2. Otherwise pp nMi (fc, nM(fc)) ^ 0. 



By Lemma 13.13 we know nM[ < nM. 

Part 1. We consider k = 2 m — m + 1, m > 4. We have nM(A:) = 2 m + 1. 

Part (a). To show pp nMl (2 m — m + 1,2™ + 1) = 0, we assume (e 0 ,ei) £ 
pPnMi(2 m — m + l,2 m + 1). Thus we know eo,ei > 2, eo,ei < 2 m — m + 1, 
eo + ei = 2 m + 1, eo < ei, whence eo < 2 m_1 , and 



nMi(2 m - m + 1 - e e + 1) + e E > 2 m + 1 



(1) 



for both £ £ {0, 1}. 

Case (a.l). Assume eo < 2 m ~ 1 — 1, and thus e\ > 2 m_1 + 2. 

From (jl|) we get nM(2 m — m + 1 — e\ + 1) + e\ > 2 m + 1, where (using Corollary 

Wi)-- 



nM(2 m — m + 1 — ei + l) + ei > 2 m + 1 =+ 

nM(2 m - m + 1 - ( 2 m ~ 1 + 2) + 1) + 2 m ~ 1 



2 > 2 m + 1 +> 



iM(2 m ~ 1 -m)> 2 m ~ 1 - 1, 



where by Corollary 7.24 we have nM(2 m_1 — m) = nM(2 m_1 — (m — 1) — 1) = 
2 m_1 — 2, and we obtained a contradiction, finishing Case (a.l). 

Case (a. 2). It remains eo = 2 m ~ 1 . From (Q) we get nMi(2 m — m + 1 — eo + 1) + 
e 0 > 2 m +l, where 2 m -m+l-e 0 +l = 2 m -m+l-2 m “ 1 +l = 2 m " 1 -(m-l)+l, and 
thus by induction hypothesis we get nMj (2 m — to + 1 — eo + 1) + eo = 2 m ~ 1 + 2 m_1 = 
2 m , a contradiction. This concludes Part (a). 

Part (b). We show (2 m_1 , 2 m_1 ) £ pp nMl (k, nM(k) — 1).^ For this it remains 
to show nM!(2™ — to + 1 — 2 m ~ 1 + 1) + 2 m ~ 1 > 2 m , and indeed nM!(2 m — m + 
1 — 2 m ~ 1 + 1) = nM 1 (2 m_1 — (to — 1) + 1) = 2 m_1 by induction hypothesis. This 
concludes Part 1. 



Part 2. k ^ 2 m — m + 1 for any m > 4. We have to show pp nMi (k, nM(fc)) ^ 0. 

Part (a), k = 2 m — m + 2; thus nM(fc) = 2 m + 2. 

We have ( 2 m ~ 1 ,2 m - 1 + 2) £ pp nMi (fc,2 m + 2Q due to nMi(2 m - m + 2 - 
2 m ~ 1 + 1) + 2 m ~ 1 = nMi(2 m_1 - (to - 1) + 2) + 2™’ 1 = 2 m ~ 1 + 2 + 2 m ~ 1 and 
nMi(2 m - to + 2 - (2 m ~ 1 + 2) + 1) + 2 m ~ 1 + 2 = nMi ( 2 m ~ 1 - (to - 1)) + 2 m ~ l + 2 = 
2 m “i+2 m — 1 + 2. 

Part (b). k = 2 m — m + 3; thus nM(fc) = 2 m + 3. 

We have (2 m -\2 m ~ 1 + 3) £ pp nMl (k, 2 m + 30 due to nMi(2 m - to + 3 - 
2 m— 1 + 1) + 2 m ~ 1 = nMi(2 m_1 - (to - 1) + 3) + 2 m ~ 1 = 2 m ~ 1 + 3 + 2™- 1 and 
nMi(2 m - to + 3 - (2 m_1 + 3) + 1) + 2“- 1 + 3 = nMi (2 m ~ 1 - (to - 1)) + 2 m ~ l + 3 = 
2 m ~ i + 2 m— 1 + 3. 

For all remaining cases 



2 m - to + 4 < k < 2 m+1 - (to + 1) 



we show (eo,ei) := (h (k), i n M(fc)) G pp nMl (fc, nM(fc)), as in Lemma 13.14 , Part [[]. 
Recall Corollary 7.25 for the computation of i n M(fe). 

For the critical cond ition “nMi(fc— ei+l)+ei > nM(fc)” we recall fc—i n M(fc)+l = 
i'{k) (recall Definition 7.13|), where i'{k) is monotonically increasing. We want to 



15 lWe have (2 m 1 ,2 m *) = (h(k), i n M(k) — 1), but we don’t need this here. 
16 1 We have (2 ro - 1 ,2 m - 1 + 2) = (h(k) - l,i nM (fc) + 1). 

17 >We have (2" 1 - 1 , 2 m ~ 1 + 3) = (h(k) - 1, i nM (fc) + 1). 
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show that i'(k) is never equal to some argument where nMi T nM, and thus we 
need to check the upper and the lower bounds: 

For k = 2 m -m + 4 holds i nM (/e) = 2 m " 1 + 2, thus i'(k) = 2 m - m + 4 - (2 m ~ 1 + 
2) + 1 = 2 m_1 - to + 3 > 2 m ~ 1 - (to - 1) + 1. 

For k = 2 m+1 -(TO+l) holds i nM (fc) = 2 m , thus i'(k) = 2 m+1 -(rn+T)-2 m +\ = 
2 m — to < 2 m — to + 1 . 



So all critical condition s for ei are fulfilled, and it remains to check eo- 

Part (c). 2 m -TO + 4<fc< 2 m+1 - (m + 1) - 2. 

Here for the critical conditions “nMi(fc — eo + 1) + eo > nM(fc)” the values 
k—e o+l are never equal to some argument where nMj ^ nM, since A(k— h(k)+l) = 
1 — A h(k) > 0 for k < 2 m+1 — (to + 1) — 2 by Theorem 7.15 , and thus k — h(k ) + 1 
is monotonically increasing for the k - range we consider. 

Checking for the lower bound k = 2 m — to + 4: i n M (k) = 2 m_1 + 2, thus 
h(k) = nM(fc) — i„M(fc) = 2 m + 4 — 2 m_1 — 2 = 2 m_1 + 2, which is the same as 
inM(fc): and doesn’t need to be checked again. 

Checking for the upper bound k = 2 m+1 — (to + 1) — 2: i n M(&) = 2 m — 1, and 
thus h(k) = nM(fc) — i n M(&) = 2 m+1 — 3 — 2 m + 1 = 2 m — 2, and so we check 
2 m+1 - (to + 1) - 2 - (2 m - 2) + 1 = 2 m - to < 2 m - to + 1. 

Part (d). k = 2 m+1 — (to + 1) — 1; thus nM(fc) = 2 m+1 — 2. 

Now i n M(fc) = 2 m , and so h(k) = nM(fc) — i n M(fc) = 2 m — 2, and so the critical 
argument is 2 m+1 — (to + 1) — 1 — (2 m — 2) + 1 = 2 m — m + 1. It remains to check 
nM 1 (2 m - to + l) + 2 m -2> 2 m+1 - 2, and indeed nM^” - TO + l) + 2 m -2 = 
2 m + 2 m - 2. 

Part (e). k = 2 m+1 — (to + 1); thus nM(fc) = 2 m+1 . 

Now h(k) = 2 m = i n M (k), and no check is needed. □ 



It is instructive to note the new A- values explicitly: 



Corollary 14.7 For k E N holds AnMi(fc) £ (0, 1,2}, with 

1 . AnMi(fc) = 0 k = 2 m — m for some to £ N, to > 3. 

2. AnMi(fc) = 2 <S==> k = 2 m — m ± 1 for some to £ N, to > 3. 



15 Conclusion and open problems 



The main subject of this report can be seen in the study of fivd(Cs—k) for classes 
WHIT CCC MCE AN and k £ N, that is, the study of the maximal minimum 
variable-degree of classes of matching-lean clause-sets containing all minimally un- 
satisfiable clause-sets, parameterised by the deficiency. If C C CEAN ', then this 
quantity is bounded, and indeed we have shown /ivd(££ ANs-k) = nM(fc) (more 
generally this holds for every subclass of CEAN containing VMU). While for 
C = MCEAN this quantity is unbounded. For MU we have shown the improved 
bound [ivd(MUs=k ) < nMi(fc), where indeed also this bound is not sharp (as will 
be shown in |l8j; see Subsection 15.2) the question about the determination of 
/jnM(fc) = fjuvd{MUs = k) is a major open research question. 

For lean clause-sets we have shown the strengthened upper bound /ivd (F) < 
nM(cr(F')), and indeed for every clause-set F we can satisfiability-equivalently re- 
move some clauses in polynomial time such that this upper bound holds. 



15.1 Conjectures and questions 

We made the following conjectures: 
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1 . 



Conjecture L0.3: If a clause-set violates the upper bound on the min-var- 
degree for lean clause-sets, then it must have a non-trivial autarky. As we have 
seen, we can determine the set of variables involved, but the determination 
of the autarky itself is open — the conjecture states that there is a poly- 
time algorithm for computing such an autarky. See Subsection 10.2 for more 
information on this topic. 



2 . 



3. 



Conjecture 12.2 : the maximum min-var-degree for unsatisfiable hitting clause- 
sets is the sa me as for the larger class of minimally unsatisfiable clause-sets. 
In Conjecture 15.5 we generalise this to non-boolean clause-sets. 



Conjecture 12.3 : nM is not far away fro m the /inM, more precisely, nM — 1 < 
/rnM < nM. The stronger Conjecture 12. 5| : the same holds even for the 
maximal number of full cl ause s, that is, nM(fc) — 1 < isfc(A4Us=k) < nM(fe) 
for all k £ N. In Lemma 15.3 we will state a weaker, but proven (in future 
work) lower bound. 

Five more conjectures will be presented in this final section. We asked also the 
following questions: 



1. Question 3.10 is about some complexity problems around the elimination of 
literal occurrences in minimally unsatisfiable clause-sets. 



2. Question 9.6 is about the complexity of SAT decision for S£ V. At first sight it 



might seem easy to translate every F £ CCS into some sat-equivalent element 
of < S£T>, and in fact to manipulate deficiency and surplus alone is rather easy, 
but we do not know how to handle them together. 



3. 



Question 10.12 concerns the existence of unsatisfiable hitting clause-sets of 
arbitrary surplus equal deficiency and a min-var-degree as low as possible. 
An underlying question is to understand better the quantity “surplus” . 



4. Question 11.2 is about strengthening the construction of Lemma 11.1, for find- 



ing matching-lean clause-sets of high minimum literal-degree (perhaps com- 
pletely different constructions are needed). 

In the remainder we outline main research areas related to the topics of the 
report. 

15.2 Improved upper bounds for /inM 

We know /inM < nMi, and we know /inM(fc) precisely for k £ {1, . . . , 6} and for 
k £ J, J + 1. Also of high relevance here is to determine ^vd(U'HITs=k) , which by 
Conjecture 12.2 is the same on /inM (k). Another major conjecture is Conjecture 



12! . which says that /inM deviates at most by 1 from nM. Beyond this article, we 



know the following improvements of the upper bound nMj: 



Generalising the ideas of Theorem 14.3, which is based on the improved upper 
bound for deficiency 2 3 — 3 + 1 = 6, we can show also for deficiency k = 
2 4 — 4 + 2 = 14 that we have /j,vd(M.U$-k) = nM(fe) — 1. Via the non- 
Mersenne operator, this yields the improved upper bound nM 2 . 

Altogether we obtain a sequence of improved upper bounds nM m _2 for to £ N, 
to > 3, improving the upper bound at deficiency k = 2 m — 2 for nM m _3 and 
applying the non-Mersenne operator. 



The infhnum of nMj, nM 2 , ... is nM w . This will be developed in 
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• However, this is not the end of it — also for deficiency k = 15 we have 
fivd(A4Us=k) = nM(fe) — 1, obtaining nM w +i. This new improvement depends 
on new ideas — will there be an infinite chain of ever-increasing complexity 
of such improvements? 

We believe that a closed “nice” formula for /rnM(fc) is impossible, but that 
however computation is possible: 



Conjecture 15.1 The function /inM : N — > N is “complex” , and for no finite tuple 
a holds NM(o) = /inM, however /inM is computable in doubly- exponential time. 



See Lemma 15.9 for some conditions which imply the computability-part of Con- 
jecture 15.1. 



15.3 Determining i4c(A 4Us=k) 



While Subsection 15.2 was about improving the upper bound, here now we turn 
to the lower bound. In Subsection |12.2 we provided only the minimum needed in 
this report for the measure fc(F) of full clauses. In the forthcoming |i 6 | we show 
the following lower bound, using S 2 : N — > N, the function for the “Smarandache 
Primitive Numbers” introduced in |)^, Unsolved Problem 47], which for k £ N is 
defined as the minimal natural number s G N such that 2 k divides s!. 

Lemma 15.2 ( ]|66| ) For all k G N holds uiciLCHITl—k) > S 2 {k). 



Lemma 15.2| yields the interesting inequality S 2 < /rnM < nM. This is relevant as 
the upper bound nM on S 2 as well as the lower bound S 2 on /rnM. From 
get that k + 1 < Szik) and thus by Corollary 7.22 we get 



we 



Lemma 15.3 ( [j66l ) k + 1 < S 2 (k) < nM(A;) < k + l + fld(fc) for k G N. 



Recall that in L emm a |l2.4 we obtained a much sharper lower bound for pnM(k) 
from Conjecture 12.3 . For sequences a, b : N — ¥ K let asymptotic equality be denoted 
by a ~ b lim„_ ) . 0 o ff- = 1 . 



Corollary 15.4 ( |66[| ) The six sequences 82 (h), vic(WHITs=k) , vic(M.lig=k) , 
/rvd(A iUs=k), /.tnM(fc), nM(fc) are asymptotically equal to (k)k^N (these are known 
facts for S 2 (k) and nM(fc ) ). 



In Figure we show the six quantities from Corollary 15.4 and the relations 
between them (we do not mention the dependencies on the deficiency k € N there). 
An arrow means a (proven) <-relation. If the arrow is labelled with “—1”, then we 
conjecture the difference is at most —1 (while in all three cases we know cases where 
the difference is equal to — 1 ), the label “=” means that we conjecture equality, and 
the label “ 00 ” means that we conjecture that the difference is unbounded. 

For a more precise asymptotic determination of these six quantities from Corol- 
lary 15.4 , calling them ak, we need to consider the six sequences as*; — k. Currently 
we only know nM(fc) — k ~ ld(fc). 



15.4 Generalisation to non-boolean clause-sets 



It is interesting to generalise Theorem 8.3 for generalised clause-sets; see |57], ^ 8 [ 
for a systematic study, while the most general notion of generalised clause-sets, 
“signed clause-sets” are discussed in [^|. Generalised clause-sets F have literals 
(v,e), meaning “v 7 ^ e”, for variables v with non-empty finite domains D v and 
values e e D v . The deficiency is generalised by giving every variable a weight 
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Figure 1: The four main combinatorial quantities, and the two numerical functions 



| A, | — 1 £ N 0 (which is 1 in the boolean case), i.e., 5(F) = c(F) - X)«evar(F)(IA'l — 
1) = C (F) + n(F) - E„ e var(F) I | ; see |i7[ Subsection 7.2]. A partial assignment 
is a map ip with some finite set of variables as domain dom(^) =: var (ip), which 
maps v £ var (ip) to ip(v) £ D v . A partial assignment <p satisfies a clause-set F 
iff for every C £ F there is (v,e) £ C with v £ var (ip) and tp(v) e. Minimally 
unsatisfiable (generalised) clause-sets are defined as usual (they are unsatisfiable, 
while every strict subset is satisfiable). In |57], Corollary 9.9] it is shown that also 
all minimally unsatisfiable generalised clause-sets F fulfil 6(F) > 1 (based, like in 
the boolean case, on matching autarkies). 

The degree vdF(u) of a variable v in a clause-set F is the sum of the degrees of 
the literals (v,e) for e £ D v , and thus v(1f(v) = \C £ F : C'fl ({u} x D v ) ^ 0|. For a 
given deficiency k £ N, the basic question is to determine the supremum of pvd(F) 
over all minimally unsatisfiable F with 5(F) = k. The base case of deficiency k = 1 
is handled in pg| , Lemma 5.4], showing that for generalised minimally unsatisfiable 
clause-sets of deficiency 1 we have /ivd (F) < max„ evar ( p) \ D v \ ; actually all structural 
knowledge from &EMU has been completely generalised in |58| , Subsection 5.2], 

But k > 2 requires more work, since here the basic method of saturation is not 
available for generalised clause-sets, as discussed in Subsection 5.1 in ^8|: saturated 
generalised clause-sets (i.e., unsatisfiable clause-sets, where no literal occurrence 
can be added without rendering the clause-set satisfiable) with deficiency at least 
2 after splitting do not necessarily generate minimally unsatisfiable (generalised) 
clause-sets. Thus the proofs for the boolean case seem not to be generalisable for 
arbitrary minimally unsatisfiable (generalised) clause-sets. 

In order to repair this, the “substitution stability parameter regarding irredun- 
dancy” sir(F) £ Z>_i U{+oo} is introduced in 1 58 , Subsection 5.3]), defined as 
the supremum of k £ Z>_ i such that for every partial assignment with n(ip) := 
|var(y>)| < k the clause-set tp * F, obtained as usual by application of p to F, 
is minimally unsatisfiable. So sir(F') > 0 iff F is minimally unsatisfiable, and as 
shown in |^8[ Corollary 4.8], sir(F) = +oo iff A is a hitting clause-set (i.e., for all 
C, D £ F, C ^ D, there are x £ C, y £ D with x = (v, e) and y = (v, e') for some 
variable v and e,e' £ D v with e ^ s'). And sir(F) > 1 iff splitting on any variable 
yields always a minimally unsatisfiable clause-set. So for a boolean clause-sets F 
holds sir(F) > 1 iff F is saturated, but for generalised clause-sets we only have that 
sir(F) > 1 implies saturatedness (psl. Corollary 5.3]). 
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In Corollary 5.10 in Q one finds a generalisation of the basic bound ^vd(F) < 
2 5(F) for the boolean case. Namely /rvd (F) < max„ evar ( f ) \D V \ ■ 5(F) is shown for 
F with sir (F) > 1. Since for (generalised) saturated F with 5(F) = 1 we have 
sir (F) = oo (|||, Corollary 5.6]), this covers the above mentioned result fivd(F) < 
max l)evar (^)|D„| for (arbitrary) minimally unsatisfiable F with 5(F) = 1 (note that 
here saturation works as in the boolean case) . 

In Q we concentrate on unsatisfiable hitting (generalised) clause-sets, and via 
generalised non-Mersenne numbers nM d (fc) we are able to generalise Theorem |S.3| 
to generalised clause-sets. We believe that in general the minimum variable degree 
of minimally unsatisfiable clause-sets F with sir(F) > 1 for a given defici ency is 
always obtained by unsatisfiable hitting clause-sets (generalising Conjecture 12.3 ): 



Conjecture 15.5 Let lATLTT$— k denote the set of generalised unsatisfiable hitting 
clause-sets of deficiency k £ N and with uniform domain-size d £ N, and let 
MUt =k sir >i be defined in the same way. Then we have for all k,d £ N that 
A ivd(UHlf 5 i k ) = /xvd(MWt fe , sir >i). 



Furthermore, the “2” in 82 (h) in Lemma 15.2 is related to the boolean domain, 
and generalising the results of this report in [ 66 to the non-boolean domain sheds 
light on Sd(k) (the minimal s £ N such that d k divides s!) for arbitrary prime 
numbers d £ N, as introduced in |)2[ Unsolved Problem 49] (while for non-prime- 
numbers d the definition of Sd has to be generalised). See Subsection III. 1 in |l9|] 
for basic properties of S p (k). 



15.5 Classification of M.U 

As mentioned in the introduction, a major motivation for us is the project of the 
clas sific ation of minimally unsatisfiable clause-sets in the deficiency (recall Examples 
3.2, |3.3|), where the main conjecture is: 



Conjecture 15.6 For every deficiency k £ N there are finitely many “ patterns ” 
which determine the nonsingular elements of MlLs=k, as well as the saturated and 
hitting cases amongst them. Especially for every k the isomorphism types of A4U' s _ k 
can be efficiently enumerated (without repetitions) , and for any given F £ 
its isomorphism type can be determined in polynomial time. 



Conjecture 15.6 has been shown for k < 2 (recall Examples |T 

the translation e : CCS 



cussed in Subsection 1.6.1 



3.3[ ). As we dis- 
TtyV has the property 
F £ MUs=k <=> e(F) £ MJVCs H =k-i for k £ N, and so the classification of A4Z4=fe 
can be seen as a subtask of the classification of MAfCs H =k for k £ No- The possibil- 
ity of a characterisation of MAfCs H =o was already raised in || (where concentration 
on the special case of saturated ( “strong” there) minimally non-2-colourable hyper- 
graphs was recommended) , but is indeed still outstanding, which is understandable, 
given that poly time decision of MUs=i is easy when compared with poly time deci- 
sion of MJ\fCs„=Q . In t he other direction, the consideration of MUs h = k C MUs<k 
(recall Subsection 1.6.5 ) could be a st eppin g stone (recall MUs h =i = UTHTs^i). 

A major step towards Conjecture 1 5 . (j should be the classification of unsatis- 
fiable hitting clause-sets in dependency on the deficiency. We remark here that 
unsatisfiable hitting clause-sets do not seem to have a close correspondence in hy- 
pergraph colouring, due to the lack of complementation in hypergraphs. Here the 
main conjecture (which should follow from Conjecture 15.6, once we found the pre- 
cise formulation of “finitely many patterns”) is: 
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Conjecture 15.7 For every deficiency k £ N there are only finitely many iso- 
morphism types of non-singular unsatisfiable hitting clause-sets, or equivalently, the 
number of variables of elements of WHTTg =k is bounded. 



For k < 2 finiteness has been established (Examples [1 . 2[ 3.3), while recently we 
were able to prove it for k = 3 ({37j). Assuming Conjecture 15.7, the question arises 
about the computability of the function, which maps k £ N to the set of isomorphism 
types. Equivalently one can consider the computability of any function, which maps 
k £ N to an upper bound on the number of variables of elements of UTHTg =k . It is 
conceivable that such functions must grow so quickly that they are not computable, 
we however believe that actually a very s mall b ound holds, and we conjecture the 
following strengthened form of Conjecture 15.7: 



Conjecture 15.8 For every k £ N and every F £ WHITg— k holds n(F) < 4 k — 5. 

This conjecture together with the other conjectures implies computability of 
/mM (using Corollary jT5|): 



Lemma 15.9 Assume that Conjecture EU holds. 



1. Then the map fc £ N 4 p'vdflA'HTTs^) is computable, by enumerating all 
possible clause-sets F with at most 4k — 5 variables, checking whether they are 
inUTLITg— k , and if so, including fj,vd(F) into the maximum- computation. 



2. If also Conjecture 12. j holds, then the function /znM is also computable. 



Conjecture 15.1 says additionally, that although /xnM is computable, it should be 
“complex” . 
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